Patents by Inventor Ning Yu
Ning Yu has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20260134527Abstract: The disclosed computer-implemented method may include receiving, by a computing device, an original video to relight. Additionally, the method may include predicting, by the computing device using a de-lighting model trained on a hybrid dataset of lighting-rich data and motion-rich data, an albedo video corresponding to the original video. The method may also include generating, by the computing device using a relighting model trained on the hybrid dataset, a relit video based on the albedo video under a specified lighting condition based on an input high dynamic range (HDR) map. Various other methods, systems, and computer-readable media are also disclosed.Type: ApplicationFiled: September 22, 2025Publication date: May 14, 2026Inventors: Yiqun Mei, Mingming He, Li Ma, Julien Olivier Victor Philip, Wenqi Xian, David M. George, Xueming Yu, Gabriel Dedic, Ahmet Levent Tasel, Ning Yu, Paul E. Debevec
-
Publication number: 20260134518Abstract: Methods for motion-controllable video diffusion include extracting optical flow fields from an input video and computing warped noise by iteratively warping noise between consecutive frames using the optical flow fields. The iteratively warping includes (i) re-Gaussianizing expanded pixel regions by sampling fresh Gaussian noise, and (ii) aggregating contracted pixel regions by merging noise particles and renormalizing variance to preserve spatial Gaussianity. An output video is generated by initializing a diffusion process with the warped noise and iteratively denoising to produce temporally coherent output frames. Various other methods, systems, and computer-readable media are also disclosed.Type: ApplicationFiled: November 13, 2025Publication date: May 14, 2026Inventors: Ryan Burgert, Yuancheng Xu, Wenqi Xian, Oliver Pilarski, Pascal Clausen, Mingming He, Li Ma, Yitong Deng, Lingxiao Li, Mohsen Mousavi, Paul Debevec, Ning Yu
-
Patent number: 12585919Abstract: Embodiments described herein provide a mechanism for replacing existing text encoders in text-to-image generation models with more powerful pre-trained language models. Specifically, a translation network is trained to map features from the pre-trained language model output into the space of the target text encoder. The training preserves the rich structure of the pre-trained language model while allowing it to operate within the text-to-image generation model. The resulting modularized text-to-image model receives prompt and generates an image representing the features contained in the prompt.Type: GrantFiled: January 31, 2023Date of Patent: March 24, 2026Assignee: Salesforce, Inc.Inventors: Ning Yu, Can Qin, Chen Xing, Shu Zhang, Stefano Ermon, Caiming Xiong, Ran Xu
-
Patent number: 12536713Abstract: Embodiments described herein provide a method of image generation. The method includes a fixed diffusion model, and a trainable diffusion model. The fixed diffusion model may be pretrained on a large training corpus. The trainable diffusion model may be used to control the image generation of the fixed diffusion model by modifying internal representations of the fixed diffusion model. A task instruction may be provided in addition to a text prompt, and the task instruction may guide the trainable diffusion model together with the visual conditions. The visual conditions may be adapted according to the task instruction. During training, a fixed number of task instructions may be used. At inference, unseen task instructions may be used by combining convolutional kernels of the visual condition adapter.Type: GrantFiled: September 29, 2023Date of Patent: January 27, 2026Assignee: Salesforce, Inc.Inventors: Ning Yu, Can Qin, Shu Zhang, Yihao Feng, Xinyi Yang, Ran Xu
-
Patent number: 12536720Abstract: Embodiments described herein provide systems and methods for multimodal layout generations for digital publications. The system may receive as inputs, a background image, one or more foreground texts, and one or more foreground images. Feature representations of the background image may be generated. The foreground inputs may be input to a layout generator which has cross attention to the background image feature representations in order to generate a layout comprising of bounding box parameters for each input item. A composite layout may be generated based on the inputs and generated bounding boxes. The resulting composite layout may then be displayed on a user interface.Type: GrantFiled: January 30, 2023Date of Patent: January 27, 2026Assignee: Salesforce, Inc.Inventors: Ning Yu, Chia-Chih Chen, Zeyuan Chen, Caiming Xiong, Juan Carlos Niebles Duque, Ran Xu, Rui Meng
-
Publication number: 20260011074Abstract: The disclosed computer-implemented method may include receiving, by a computing device, multi-view flat-lit performance data of a subject. Additionally, the method may include rendering, by the computing device, a dynamic sequence of novel-view flat-lit images of the subject based on a deformable three-dimensional Gaussian splatting (3DGS) model. The method may also include providing the rendered dynamic sequence of flat-lit images as input to a diffusion-based relighting model trained on the multi-view flat-lit performance data of the subject. Furthermore, the method may include generating, by the computing device using the diffusion-based relighting model, a relit sequence of the subject under a specified lighting condition. Various other methods, systems, and computer-readable media are also disclosed.Type: ApplicationFiled: July 2, 2025Publication date: January 8, 2026Inventors: Mingming He, Pascal Clausen, Ahmet Tasel, Li Ma, Oliver Pilarski, Wenqi Xian, Laszlo Rikker, Xueming Yu, Ryan Burgert, Ning Yu, Paul E. Debevec
-
Publication number: 20260000322Abstract: This disclosure is directed to systems and techniques for detecting change in patient health based upon patient data. In one example, a medical system comprising processing circuitry communicably coupled to a glucose sensor and configured to generate continuous glucose sensor measurements of a patient. The processing circuitry is further configured to: extract at least one feature from the continuous glucose sensor measurements over at least one time period, wherein the at least one feature comprises one or more of an amount of time within a pre-determined glucose level range, a number of hypoglycemia events, a number of hyperglycemia events, or one or more statistical metrics corresponding to the continuous glucose sensor measurements; apply a machine learning model to the at least one extracted feature to produce data indicative of a risk of a cardiovascular event; and generate output data based on the risk of the cardiovascular event.Type: ApplicationFiled: September 4, 2025Publication date: January 1, 2026Inventors: Kamal Deep Mothilal, Michael D. Eggen, Ning Yu, John P. Keane, Shantanu Sarkar, Randal C. Schulhauser, David L. Probst, Mark R. Boone, Kenneth A. Timmerman, Stanley J. Taraszewski, Matthew A. Joyce, Amruta Paritosh Dixit, Kathryn Hilpisch, Kathryn Ann Milbrandt, Laura M. Zimmerman, Matthew L. Plante
-
Patent number: 12494004Abstract: Embodiments described herein provide a feedback based instructional image editing framework that employs a diffusion process to follow user instruction for image editing. A diffusion model is fine-tuned using a reward model, which may be trained via human annotation. The training of the reward model may be done by having the image editing model output a number of images, which a human annotator ranks based on their alignment with the original image and a given instruction.Type: GrantFiled: July 12, 2023Date of Patent: December 9, 2025Assignee: Salesforce, Inc.Inventors: Shu Zhang, Xinyi Yang, Yihao Feng, Ran Xu, Ning Yu, Chia-Chih Chen
-
Patent number: 12493795Abstract: Embodiments described herein provide systems and methods for training a text retrieval model. A system may generate queries associated with provided documents. The queries may be generated in one or more different manners. Examples of query generation may include extracting relevant spans of text from the documents, prompting a language model for a topic, title, abstractive summary, and/or extractive summary based on the documents. Metadata such as title or other HTML tags may be used as queries. Using the one or more queries, the text retrieval model may be trained using contrastive learning, using the generated query, and positive and negative sample documents. A fine-tuning training phase may be performed using domain-specific data which may also be done with generated query pairs, or may be done in a supervised fashion with provided queries. The text retrieval model may be used to locate documents given an input query.Type: GrantFiled: April 19, 2023Date of Patent: December 9, 2025Assignee: Salesforce, Inc.Inventors: Rui Meng, Yingbo Zhou, Ye Liu, Semih Yavuz, Ning Yu
-
Patent number: 12430849Abstract: A method of training a neural network based three-dimensional (3D) encoder is provided. A first plurality of samples of a training dataset are generated using a first 3D model. An image generator with multi-view rendering is used to generate a plurality of two-dimensional (2D) images having different viewpoints of the first 3D model. A first language model is used to generate a plurality of texts corresponding to the plurality of 2D images respectively. A first text for a first image is generated by using one or more text descriptions generated by the first language model. A point cloud is generated by randomly sampling points in the 3D model. The first plurality of samples are generated using the plurality of 2D images, the corresponding plurality of texts, and the point cloud. The neural network based 3D encoder is trained using the training dataset including the first plurality of samples.Type: GrantFiled: October 24, 2023Date of Patent: September 30, 2025Assignee: Salesforce, Inc.Inventors: Le Xue, Ning Yu, Shu Zhang, Junnan Li, Caiming Xiong, Silvio Savarese, Juan Carlos Niebles Duque, Ran Xu
-
Patent number: 12426811Abstract: This disclosure is directed to systems and techniques for detecting change in patient health based upon patient data. In one example, a medical system comprising processing circuitry communicably coupled to a glucose sensor and configured to generate continuous glucose sensor measurements of a patient. The processing circuitry is further configured to: extract at least one feature from the continuous glucose sensor measurements over at least one time period, wherein the at least one feature comprises one or more of an amount of time within a pre-determined glucose level range, a number of hypoglycemia events, a number of hyperglycemia events, or one or more statistical metrics corresponding to the continuous glucose sensor measurements; apply a machine learning model to the at least one extracted feature to produce data indicative of a risk of a cardiovascular event; and generate output data based on the risk of the cardiovascular event.Type: GrantFiled: May 16, 2022Date of Patent: September 30, 2025Assignee: Medtronic, Inc.Inventors: Kamal Deep Mothilal, Michael D. Eggen, Ning Yu, John P Keane, Shantanu Sarkar, Randal C. Schulhauser, David L. Probst, Mark R. Boone, Kenneth A Timmerman, Stanley J Taraszewski, Matthew A Joyce, Amruta Paritosh Dixit, Kathryn E. Hilpisch, Kathryn Ann Milbrandt, Laura M Zimmerman, Matthew L Plante
-
Publication number: 20250297297Abstract: A system includes a chip-receiving component, a first fluid processing assembly, a second fluid processing assembly, and a fluid communication pathway. The chip-receiving component is to receive a process chip having microfluidic passageways. The first fluid processing assembly is to communicate fluids to microfluidic passageways of a process chip received by the chip-receiving component. The second fluid processing assembly includes a sample support feature to support sample containers. The second fluid processing assembly also includes a plurality of sampling heads to selectively communicate fluids from sample containers supported by the sample support feature. The fluid communication pathway includes a plurality of conduits to provide fluid communication between the first fluid processing assembly and the plurality of sampling heads.Type: ApplicationFiled: June 4, 2025Publication date: September 25, 2025Inventors: Eric Chu, Tamas Czimmermann, Benjamin Eldridge, Kenneth Jordan, Ximiao Wen, Ning Yu
-
Patent number: 12394220Abstract: Embodiments described herein provide a system for three-dimensional (3D) object detection. The system includes an input interface configured to obtain 3D point data describing spatial information of a plurality of points, and a memory storing a neural network based 3D object detection model having an encoder and a decoder. The system also includes processors to perform operations including: encoding, by the encoder, a first set of coordinates into a first set of point features and a set of object features; sampling a second set of point features from the first set of point features; generating, by attention layers at the decoder, a set of attention weights by applying cross-attention over at least the set of object features and the second set of point feature, and generate, by the decoder, a predicted bounding box among the plurality of points based on at least in part on the set of attention weights.Type: GrantFiled: January 30, 2023Date of Patent: August 19, 2025Assignee: Salesforce, Inc.Inventors: Manli Shu, Le Xue, Ning Yu, Roberto Martín-Martín, Juan Carlos Niebles Duque, Caiming Xiong, Ran Xu
-
Patent number: 12387340Abstract: Embodiments described herein provide an open-vocabulary instance segmentation framework that adopts a pre-trained vision-language model to develop a pipeline in detecting novel categories of instances.Type: GrantFiled: January 25, 2023Date of Patent: August 12, 2025Assignee: Salesforce, Inc.Inventors: Ning Yu, Vibashan Vishnukumar Sharmini, Chen Xing, Juan Carlos Niebles Duque, Ran Xu
-
Publication number: 20250075361Abstract: The present invention relates to a process for manufacturing a silver-plated copper or a copper alloy blank wire, having a silver layer thickness of 1.5 ?m to 15 ?m, comprising the step of electrolytically depositing silver on the copper or copper alloy blank wire, said electrolytic deposition being performed at a pulsating current with reversal in a silver-plating bath under particular electrolytic conditions. It also relates to the silver-plated copper or copper alloy blank wire obtainable by said process, a process for manufacturing a silver-plated copper or copper alloy strand, the strand obtainable by said process, a silver-plated conductor and an electromagnetic shielding layer comprising the silver-plated strand, an electric wire comprising the silver-plated conductor and an electric cable comprising the electric wire and uses thereof.Type: ApplicationFiled: April 11, 2022Publication date: March 6, 2025Inventors: Ning YU, Aurélie M. DE JESUS
-
Publication number: 20250068901Abstract: Embodiments described herein provide a diffusion-based framework that is trained on a dataset with limited text labels, to generate a distribution of data samples in the dataset given a specific text description label. Specifically, firstly, unlabeled data is used to train the diffusion model to generate a data distribution of data samples given a specific text description label. Then text-labeled data samples are used to finetune the diffusion model to generate data distribution given a specific text description label, thus enhancing controllability of training.Type: ApplicationFiled: January 25, 2024Publication date: February 27, 2025Inventors: Shiyu Wang, Yihao Feng, Tian Lan, Ning Yu, Yu Bai, Ran Xu, Huan Wang, Caiming Xiong, Silvio Savarese
-
Patent number: 12236690Abstract: The present disclosure provides a vehicle control method, an apparatus, an electronic device and a vehicle, which relates to a technical field of artificial intelligence, such as automatic driving, intelligent transportation and computer vision, etc. The specific implementation solution is that: when passing a traffic light intersection, a vehicle can acquire a traffic light image of the intersection at a driving direction of the vehicle, identify the traffic light image, determine a first light color of a target traffic light corresponding to a lane on which the vehicle is located; and acquire light color change information of the target traffic light latest recorded; determine remained duration of the first light color according to the light color change information, then determine a driving strategy at the intersection of the vehicle according to the first light color and the remained duration of the first light color.Type: GrantFiled: January 10, 2022Date of Patent: February 25, 2025Assignee: Apollo Intelligent Connectivity (Beijing) Technology Co., Ltd.Inventors: Xingyu Wang, Dengxiang Zhuang, Ning Yu
-
Publication number: 20240386623Abstract: Embodiments described herein provide a method of image generation. The method includes a fixed diffusion model, and a trainable diffusion model. The fixed diffusion model may be pretrained on a large training corpus. The trainable diffusion model may be used to control the image generation of the fixed diffusion model by modifying internal representations of the fixed diffusion model. A task instruction may be provided in addition to a text prompt, and the task instruction may guide the trainable diffusion model together with the visual conditions. The visual conditions may be adapted according to the task instruction. During training, a fixed number of task instructions may be used. At inference, unseen task instructions may be used by combining convolutional kernels of the visual condition adapter.Type: ApplicationFiled: September 29, 2023Publication date: November 21, 2024Inventors: Ning YU, Can QIN, Shu ZHANG, Yihao FENG, Xinyi YANG, Ran XU
-
Publication number: 20240370718Abstract: Embodiments described herein provide a method of generating a multi-modal task output to a text instruction relating to inputs of multiple different modalities (e.g., text, audio, video, 3D). The method comprises receiving, via a data interface, a first input of a first modality, a second input of a second modality and the text instruction relating to the first and the second inputs; encoding, by a first multimodal encoder adapted for the first modality, the first input of the first modality into a first encoded representation conditioned on the text instruction; encoding, by a second multimodal encoder adapted for the second modality, the second input of the second modality into a second encoded representation conditioned on the text instruction; and generating, by a neural network based language model, the multi-modal task output based on an input combining the first encoded representation, the second encoded representation, and the text instruction.Type: ApplicationFiled: December 29, 2023Publication date: November 7, 2024Inventors: Artemis Panagopoulou, Le Xue, Ning Yu, Junnan Li, Dongxu Li, Silvio Savarese, Shafiq Rayhan Joty, Ran Xu, Caiming Xiong, Juan Carlos Niebles Duque
-
Patent number: 12103516Abstract: In response to a request to park an ADV in a parking lot, a set of parking-trajectories associated with a set of predetermined locations near one or more parking spots in the parking lot may be obtained, where the set of parking-trajectories was previously generated based on prior collected planning and control data of the parking lot. Each parking-trajectory of the set of parking-trajectories may correspond to one parking spot of the one or more parking spots in the parking lot. A parking-trajectory may be selected from the set of parking-trajectories based on a current location of the ADV. The ADV may be controlled to park in a corresponding parking spot according to the selected parking-trajectory.Type: GrantFiled: March 31, 2020Date of Patent: October 1, 2024Assignees: BAIDU USA LLC, Baidu.com Times Technology (Beijing) Co., Ltd.Inventors: Xin Xu, Fan Zhu, Dongchun Yao, Ning Yu