Patents by Inventor Dahua Lin

Dahua Lin has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

METHOD AND UNIFIED FRAMEWORK SYSTEM FOR FULL-STACK AUTONOMOUS DRIVING PLANNING

Publication number: 20240193793

Abstract: The present application provides a method and a unified framework system for full-stack autonomous driving planning. The method comprises: acquiring an image of a scene and converting the image into an image feature, and converting the image feature into a bird's-eye view feature map; detecting agents from the bird's-eye view feature map through track query vectors, and continuously tracking the agents; segmenting different types of map elements from the bird's-eye view feature map through map query vectors, and continuously updating the map elements; predicting a future trajectory of each agent using an interaction between the agents and the different map elements; predicting, according to the predicted future trajectory of each agent, an occupancy grid map over multi-steps into the future; and decoding an ego-vehicle query vector to generate a planned path of an ego-vehicle, and optimizing the planned path using the predicted future multi-step occupancy grid map.

Type: Application

Filed: April 25, 2023

Publication date: June 13, 2024

Applicant: Shanghai Artificial Intelligence Innovation Center

Inventors: Hongyang LI, Li CHEN, Jiazhi YANG, Yihan HU, Chonghao SIMA, Tianyu LI, Lewei LU, Yu LIU, Qiang LIU, Junchi YAN, Dahua LIN, Yu QIAO, Xiaogang WANG
Image tagging based upon cross domain context

Patent number: 11544588

Abstract: A method described herein includes receiving a digital image, wherein the digital image includes a first element that corresponds to a first domain and a second element that corresponds to a second domain. The method also includes automatically assigning a label to the first element in the digital image based at least in part upon a computed probability that the label corresponds to the first element, wherein the probability is computed through utilization of a first model that is configured to infer labels for elements in the first domain and a second model that is configured to infer labels for elements in the second domain. The first model receives data that identifies learned relationships between elements in the first domain and elements in the second domain, and the probability is computed by the first model based at least in part upon the learned relationships.

Type: Grant

Filed: April 3, 2019

Date of Patent: January 3, 2023

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Simon John Baker, Ashish Kapoor, Gang Hua, Dahua Lin
NETWORK TRAINING METHOD, ELECTRONIC DEVICE AND STORAGE MEDIUM

Publication number: 20220327385

Abstract: The present disclosure relates to a network training method, an electronic device and a storage medium. The network training method includes the following steps. At least one implicit vector may be input into at least one pre-trained generative network to obtain a first generated image; the generative network may be obtained with a discriminative network through adversarial trainings with a plurality of natural images. A degradation process may be performed on the first generated image to obtain a first degraded image of the first generated image. The implicit vector and the generative network may be trained according to the first degraded image and a second degraded image of at least one target image; the trained generative network and the trained implicit vector may be used to generate at least one reconstructed image of the target image.

Type: Application

Filed: June 29, 2022

Publication date: October 13, 2022

Inventors: Xingang PAN, Xiaohang ZHAN, Bo DAI, Dahua LIN, Ping LUO
OBJECT RECOGNITION METHODS AND DEVICES, AND STORAGE MEDIA

Publication number: 20220222922

Abstract: Object recognition methods and devices, and storage media are provided. In one aspect, an object recognition method includes: acquiring to-be-processed point cloud data and processing the to-be-processed point cloud data to obtain target point cloud data, the to-be-processed point cloud data including point cloud data of an to-be-recognized object; recognizing the to-be-recognized object from the target point cloud data and determining a target feature of the to-be-recognized object; and determining, according to the target feature, a target category to which the to-be-recognized object belongs among a plurality of categories to obtain a recognition result for the to-be-recognized object. The recognition result includes at least the target category.

Type: Application

Filed: April 1, 2022

Publication date: July 14, 2022

Inventors: Xinge ZHU, Tai WANG, Yan XU, Jianping SHI, Dahua LIN
Methods and apparatuses for deep learning-based recommendation, electronic devices, and media

Patent number: 11379901

Abstract: Methods and apparatuses for deep learning-based recommendation, electronic devices, and media include: respectively obtaining related information of a target user and related information of a target item; respectively using at least two sub-models in an integrated model to obtain, based on the related information of the target user and the related information of the target item, operating probabilities corresponding to the at least two sub-models; obtaining, based on the operating probabilities corresponding to the at least two sub-models, a target probability about the target user operating the target item; and recommending the target item to the target user based on the target probability.

Type: Grant

Filed: June 26, 2019

Date of Patent: July 5, 2022

Assignee: BEIJING SENSETIME TECHNOLOGY DEVELOPMENT CO., LTD

Inventors: Zhiyu Min, Dahua Lin
METHOD FOR ISOLATING SOUND, ELECTRONIC EQUIPMENT, AND STORAGE MEDIUM

Publication number: 20220130407

Abstract: Input sound spectra are acquired. The input sound spectra include sound spectra corresponding to multiple sound sources. Predicted sound spectra are isolated from the input sound spectra by performing spectrum isolation processing on the input sound spectra. Updated input sound spectra are acquired by removing the predicted sound spectra from the input sound spectra. Next isolated predicted sound spectra continue to be acquired through the updated input sound spectra, until the updated input sound spectra include no sound spectrum.

Type: Application

Filed: January 6, 2022

Publication date: April 28, 2022

Inventors: Xudong XU, Bo DAI, Dahua LIN
Anchor determination method and apparatus, electronic device, and storage medium

Patent number: 11301726

Abstract: An anchor determination includes: performing feature extraction on an image to be processed to obtain a first feature map of the image to be processed; performing anchor prediction on the first feature map via an anchor prediction network to obtain position information of anchors and shape information of the anchors in the first feature map, the position information of the anchors referring to information about positions in the first feature map where the anchors are generated. A corresponding anchor determination apparatus and a storage medium are also provided.

Type: Grant

Filed: April 21, 2020

Date of Patent: April 12, 2022

Assignee: BEIJING SENSETIME TECHNOLOGY DEVELOPMENT CO., LTD.

Inventors: Kai Chen, Jiaqi Wang, Shuo Yang, Chen Change Loy, Dahua Lin
Keyframe scheduling method and apparatus, electronic device, program and medium

Patent number: 11164004

Abstract: A key frame scheduling method and apparatus include: performing feature extraction on a current frame through a first network layer of a neural network to obtain low-layer features of the current frame acquiring a scheduling probability of the current frame according to low-level features of a previous key frame adjacent to the current frame and the low-level features of the current frame; determining whether the current frame is scheduled as a key frame according to the scheduling probability value of the current frame; and when determining that the current frame is scheduled as a key frame, performing feature extraction on low-level features of a current key frame via a second network layer of the neural network to obtain high-level features of the current key frame, where the network depth of the first network layer is less than the network depth of the second network layer.

Type: Grant

Filed: December 25, 2018

Date of Patent: November 2, 2021

Assignee: BEIJING SENSETIME TECHNOLOGY DEVELOPMENT CO., LTD.

Inventors: Jianping Shi, Yule Li, Dahua Lin
SEARCH METHOD AND DEVICE, AND STORAGE MEDIUM

Publication number: 20210326383

Abstract: A search method, a search device, a storage medium and a computer program. The search method includes: determining a first similarity between text and at least one video, the text being used for representing a search condition; determining a first character interaction graph of the text and a second character interaction graph of the at least one video; determining a second similarity between the first character interaction graph and the second character interaction graph; and according to the first similarity and the second similarity, determining a video matching the search condition from the at least one video.

Type: Application

Filed: June 29, 2021

Publication date: October 21, 2021

Applicant: BEIJING SENSETIME TECHNOLOGY DEVELOPMENT CO., LTD.

Inventors: Yu XIONG, Qingqiu HUANG, Lingfeng GUO, Hang ZHOU, Bolei ZHOU, Dahua LIN
Method and device for video processing, electronic device, and storage medium

Patent number: 11120078

Abstract: The present disclosure relates to a method and device for video processing, an electronic device, and a storage medium. The method comprises: determining, on the basis of paragraph information of a query text paragraph and video information of multiple videos in a video library, preselected videos associated with the query text paragraph in the multiple videos; and determining a target video in the preselected videos on the basis of video frame information of the preselected videos and of sentence information of the query text paragraph. The method for video processing of the embodiments of the present disclosure indexes videos by means of the relevance between the videos and the query text paragraph, allows the pinpointing of the target video, avoids search result redundancy, allows the processing of the query text paragraph in a natural language form, and is not limited by the inherent contents of content labels.

Type: Grant

Filed: August 6, 2019

Date of Patent: September 14, 2021

Assignee: BEIJING SENSETIME TECHNOLOGY DEVELOPMENT CO., LTD.

Inventors: Xiaoou Tang, Dian Shao, Yu Xiong, Yue Zhao, Qingqiu Huang, Yu Qiao, Dahua Lin
IMAGE PROCESSING METHOD AND DEVICE, AND NETWORK TRAINING METHOD AND DEVICE

Publication number: 20210279892

Abstract: An image processing method and a device, and a network training method and a device are provided. The image processing method includes determining a guide group arranged on an image to be processed and directed at a target object, the guide group comprising at least one guide point, and the guide point being used to indicate the position of a sampling pixel, and the magnitude and direction of the motion speed of the sampling pixel; and on the basis of the guide point in the guide group and the image to be processed, performing optical flow prediction to obtain the motion of the target object in the image to be processed.

Type: Application

Filed: May 25, 2021

Publication date: September 9, 2021

Inventors: Xiaohang ZHAN, Xingang PAN, Ziwei LIU, Dahua LIN, Chen Change LOY
METHOD AND APPARATUS FOR NEUTRAL NETWORK TRAINING, METHOD AND APPARATUS FOR IMAGE GENERATION, AND STORAGE MEDIUM

Publication number: 20210224607

Abstract: Provided are a method and apparatus for neutral network training and a method and apparatus for image generation. The method includes that: a first random vector is input to a generator to obtain a first generated image; the first generated image and a first real image are input to a discriminator to obtain a first discriminated distribution and a second discriminated distribution; a first network loss of the discriminator is determined based on the first discriminated distribution, the second discriminated distribution, a first target distribution and a second target distribution; a second network loss of the generator is determined based on the first discriminated distribution and the second discriminated distribution; and adversarial training is performed on the generator and the discriminator based on the first network loss and the second network loss.

Type: Application

Filed: April 2, 2021

Publication date: July 22, 2021

Inventors: Yubin DENG, Bo DAI, Yuanbo XIANGLI, Dahua LIN, Chen Change LOY
Image Processing Method and Device, and Storage Medium

Publication number: 20210209392

Abstract: The present disclosure relates an image processing method and device, and a storage medium. The method comprises: performing a feature equalization processing on a sample image by an equalization subnetwork of a detection network to obtain an equalized feature image of the sample image; performing a target detection processing on the equalized feature image by a detection subnetwork of the detection network to obtain predicted regions of a target object in the equalized feature image; determining an intersection-over-union of each of the predicted regions respectively; sampling the plurality of predicted regions according to the intersection-over-union of each predicted region to obtain a target region; and training the detection network according to the target region and a labeled region. The systems and techniques disclosed here can reduce information loss and improve the training effect and training efficiency.

Type: Application

Filed: March 23, 2021

Publication date: July 8, 2021

Applicant: Beijing Sensetime Technology Development Co., Ltd.

Inventors: Jiangmiao Pang, Kai Chen, Jianping Shi, Dahua Lin, Wanli Ouyang, Huajun Feng
Magnifying feature map

Patent number: 11049217

Abstract: An example of the present disclosure provides methods, apparatuses and devices for magnifying a feature map, and a computer readable storage medium. The method includes: receiving a source feature map to be magnified; obtaining N reassembly kernels corresponding to each source position in the source feature map by performing convolution on the source feature map, wherein N refers to a square of a magnification factor of the source feature map; obtaining, for each of the reassembly kernels, a normalized reassembly kernel by performing normalization; obtaining, for each source position in the source feature map, N reassembly features corresponding to the source position by reassembling features of a reassembly region determined according to the source position with N normalized reassembly kernels corresponding to the source position; and generating a target feature map according to the N reassembly features corresponding to each source position in the source feature map.

Type: Grant

Filed: December 15, 2020

Date of Patent: June 29, 2021

Assignee: Beijing Sensetime Technology Development Co., Ltd.

Inventors: Jiaqi Wang, Kai Chen, Rui Xu, Ziwei Liu, Chen Change Loy, Dahua Lin
MAGNIFYING FEATURE MAP

Publication number: 20210104015

Abstract: An example of the present disclosure provides methods, apparatuses and devices for magnifying a feature map, and a computer readable storage medium. The method includes: receiving a source feature map to be magnified; obtaining N reassembly kernels corresponding to each source position in the source feature map by performing convolution on the source feature map, wherein N refers to a square of a magnification factor of the source feature map; obtaining, for each of the reassembly kernels, a normalized reassembly kernel by performing normalization; obtaining, for each source position in the source feature map, N reassembly features corresponding to the source position by reassembling features of a reassembly region determined according to the source position with N normalized reassembly kernels corresponding to the source position; and generating a target feature map according to the N reassembly features corresponding to each source position in the source feature map.

Type: Application

Filed: December 15, 2020

Publication date: April 8, 2021

Inventors: Jiaqi WANG, Kai CHEN, Rui XU, Ziwei LIU, Chen Change LOY, Dahua LIN
Time domain action detecting methods and system, electronic devices, and computer storage medium

Patent number: 10915741

Abstract: Time domain action detecting methods and systems, electronic devices, and computer storage medium are provided. The method includes: obtaining a time domain interval in a video with an action instance and at least one adjacent segment in the time domain interval; separately extracting action features of at least two video segments in candidate segments, where the candidate segments comprises video segment corresponding to the time domain interval and adjacent segments thereof; pooling the action features of the at least two video segments in the candidate segments, to obtain a global feature of the video segment corresponding to the time domain interval; and determining, based on the global feature, an action integrity score of the video segment corresponding to the time domain interval. The embodiments of the present disclosure benefit accurately determining whether a time domain interval comprises an integral action instance, and improve the accuracy rate of action integrity identification.

Type: Grant

Filed: December 28, 2018

Date of Patent: February 9, 2021

Assignee: Beijing SenseTime Technology Development Co., Ltd

Inventors: Xiaoou Tang, Yuanjun Xiong, Yue Zhao, Limin Wang, Zhirong Wu, Dahua Lin
METHOD AND DEVICE FOR VIDEO PROCESSING, ELECTRONIC DEVICE, AND STORAGE MEDIUM

Publication number: 20200394216

Abstract: The present disclosure relates to a method and device for video processing, an electronic device, and a storage medium. The method comprises: determining, on the basis of paragraph information of a query text paragraph and video information of multiple videos in a video library, preselected videos associated with the query text paragraph in the multiple videos; and determining a target video in the preselected videos on the basis of video frame information of the preselected videos and of sentence information of the query text paragraph. The method for video processing of the embodiments of the present disclosure indexes videos by means of the relevance between the videos and the query text paragraph, allows the pinpointing of the target video, avoids search result redundancy, allows the processing of the query text paragraph in a natural language form, and is not limited by the inherent contents of content labels.

Type: Application

Filed: August 6, 2019

Publication date: December 17, 2020

Inventors: Xiaoou TANG, Dian SHAO, Yu XIONG, Yue ZHAO, Qingqiu HUANG, Yu Qiao, Dahua LIN
KEYFRAME SCHEDULING METHOD AND APPARATUS, ELECTRONIC DEVICE, PROGRAM AND MEDIUM

Publication number: 20200394414

Abstract: A key frame scheduling method and apparatus include: performing feature extraction on a current frame through a first network layer of a neural network to obtain low-layer features of the current frame acquiring a scheduling probability of the current frame according to low-level features of a previous key frame adjacent to the current frame and the low-level features of the current frame; determining whether the current frame is scheduled as a key frame according to the scheduling probability value of the current frame; and when determining that the current frame is scheduled as a key frame, performing feature extraction on low-level features of a current key frame via a second network layer of the neural network to obtain high-level features of the current key frame, where the network depth of the first network layer is less than the network depth of the second network layer.

Type: Application

Filed: December 25, 2018

Publication date: December 17, 2020

Applicant: BEIJING SENSETIME TECHNOLOGY DEVELOPMENT CO., LTD.

Inventors: Jianping SHI, Yule LI, Dahua LIN
ANCHOR DETERMINATION METHOD AND APPARATUS, ELECTRONIC DEVICE, AND STORAGE MEDIUM

Publication number: 20200250495

Abstract: An anchor determination includes: performing feature extraction on an image to be processed to obtain a first feature map of the image to be processed; performing anchor prediction on the first feature map via an anchor prediction network to obtain position information of anchors and shape information of the anchors in the first feature map, the position information of the anchors referring to information about positions in the first feature map where the anchors are generated. A corresponding anchor determination apparatus and a storage medium are also provided.

Type: Application

Filed: April 21, 2020

Publication date: August 6, 2020

Inventors: Kai CHEN, Jiaqi Wang, Shuo Yang, Chen Change Loy, Dahua Lin
IMAGE TAGGING BASED UPON CROSS DOMAIN CONTEXT

Publication number: 20190362247

Abstract: A method described herein includes receiving a digital image, wherein the digital image includes a first element that corresponds to a first domain and a second element that corresponds to a second domain. The method also includes automatically assigning a label to the first element in the digital image based at least in part upon a computed probability that the label corresponds to the first element, wherein the probability is computed through utilization of a first model that is configured to infer labels for elements in the first domain and a second model that is configured to infer labels for elements in the second domain. The first model receives data that identifies learned relationships between elements in the first domain and elements in the second domain, and the probability is computed by the first model based at least in part upon the learned relationships.

Type: Application

Filed: April 3, 2019

Publication date: November 28, 2019

Inventors: Simon John Baker, Ashish Kapoor, Gang Hua, Dahua Lin

1 2 next