Patents by Inventor Fan Chen
Fan Chen has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20260155158Abstract: Examples are provided relating to system evolving architectures for refining media content editing systems. One aspect includes a method of refining a media content editing architecture, the method comprising: editing a media content using a large language model and a back-end tool service comprising a prompt pool and a plurality of application programming interfaces corresponding to a plurality of editing tools; publishing the edited media content; storing contextual information relating to the editing of the media content; and refining the media content editing architecture using the stored contextual information.Type: ApplicationFiled: January 23, 2026Publication date: June 4, 2026Inventors: Fan Chen, Kin Chung Wong
-
Publication number: 20260112156Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for long video understanding. One of the methods includes obtaining a video; extracting frames from the video; encoding the extracted frames; extracting multimodal features including encoding speech content from the video as a text modality; encoding spatiotemporal dependencies between the extracted frames and aligned multi-modal features; providing the encoded temporal dependencies to a language model (LM) that learns spatiotemporal understanding of the entire video content; and using the language model to respond to user queries about the content of the video.Type: ApplicationFiled: October 22, 2024Publication date: April 23, 2026Inventors: Guang Chen, Dawei Du, Fan Chen, Longyin Wen, Ye Yuan, Wen Zhong, Sijie Zhu, Chia-Wen Kuo, Ziaohui Shen
-
Publication number: 20260093934Abstract: Embodiments of the disclosure relate to a method, an apparatus, a device and a computer readable storage medium for generating data. The method proposed herein includes: obtaining a first feature representation by sampling from a target feature space, the target feature space being determined by processing a set of training samples with an encoding unit; processing the first feature representation with a diffusion unit to determine a second feature representation; and providing a second feature representation to a pre-trained language model to generate a target data sample.Type: ApplicationFiled: September 29, 2025Publication date: April 2, 2026Inventors: Ying Zhou, Xinyao Wang, Yulei Niu, Yaojie Shen, Lexin Tang, Fan Chen, Longyin Wen
-
Publication number: 20260093978Abstract: The present application discloses a model training method, an image editing method, an apparatus, a device, a medium, and a product. The method includes: first acquiring an original image, an editing description text corresponding to the original image, an edited image corresponding to the original image with respect to the editing description text, and evaluation information of the edited image, enabling the evaluation information to describe the state of the edited image in at least one evaluation item. Then the original image, the editing description text and the evaluation information by using an image editing model are processed to obtain an image editing result corresponding to the original image. According to the difference between the image editing result and the edited image, the image editing model is updated.Type: ApplicationFiled: August 29, 2025Publication date: April 2, 2026Inventors: Xin GU, Sijie ZHU, Fan CHEN, Longyin WEN
-
Publication number: 20260094056Abstract: The present disclosure describes techniques for improving efficiency and flexibility of a machine learning model. A machine learning model is configured to decompose self-attention in the machine learning model into a plurality of attention operations. The machine learning model is configured to process information from a plurality of modalities. Concatenated tokens are received by the machine learning model. The concatenated tokens comprise multimodal tokens representative of a content item and textual tokens indicative of a text query. Updated multimodal tokens for a next layer of computation are generated by performing diagonal-attention on the multimodal tokens. Updated textual tokens for the next layer of computation are generated by performing self-attention on the textual tokens and performing cross-attention between the multimodal tokens and the textual tokens.Type: ApplicationFiled: September 30, 2024Publication date: April 2, 2026Inventors: Chia-Wen Kuo, Sijie Zhu, Fan Chen, Longyin Wen
-
Method for improving shunting route setting of centralized traffic control system, device and medium
Patent number: 12589786Abstract: A method is provided for shunting route setting of a centralized traffic control system. The method includes S1, acquiring a shunting operation plan from an interface of a SMIS and generating a shunting route sequence, by a centralized traffic control system; S2, adding an operation direction, a location identifier of a shunter, a current control mode of the shunter and a description of a next route based on an original shunter number of the centralized traffic control system; S3, configuring a manual setting mode and an automatic triggering mode for shunting route setting; S4, adding identifiers of the manual setting mode and the automatic triggering mode to shunter number display; and S5, in the manual setting mode, adding a shunting plan constraint state and a free setting state to a shunting route setting logic of the centralized traffic control system.Type: GrantFiled: November 29, 2022Date of Patent: March 31, 2026Assignee: CASCO SIGNAL LTD.Inventors: Fan Chen, Baowei Tang, Ruyue Wang, Gang Wang, Yuehua Zhai, Zhenguo Feng -
Publication number: 20260065670Abstract: The present disclosure describes techniques for generating video descriptions using a machine learning model. A plurality of sets of visual tokens corresponding to a plurality of frames of a video is generated. A first type of tokens is generated by implementing temporal pooling on the plurality of sets of visual tokens corresponding to the plurality of frames. A second type of tokens is generated by compressing each of the plurality of sets of visual tokens corresponding to each of the plurality of frames. A third type of tokens is generated by applying cross-attention between each of the plurality of sets of visual tokens and a fourth type of tokens including text tokens generated based on an input text query. A text description of the video is generated based on the first type of tokens, the second type of tokens, the third type of tokens, and the fourth type of tokens.Type: ApplicationFiled: September 3, 2024Publication date: March 5, 2026Inventors: Lu Xu, Sijie Zhu, Fan Chen, Longyin Wen
-
Publication number: 20260065036Abstract: Embodiments of the disclosure relate to a method, an apparatus, a device, and a computer-readable storage medium for training a generative model. The method includes: constructing a training prompt; and performing a plurality of rounds of iterative training based on the training prompt, wherein each round of iterative training includes: obtaining a plurality of response contents generated by the generative model based on the training prompt; determining a first response content and a second response content from the plurality of response contents based on evaluation information of the plurality of response contents, wherein an evaluation of the first response content is superior to an evaluation of the second response content; and adjusting a parameter of the generative model to increase a first probability of outputting the first response content and reduce a second probability of outputting the second response content.Type: ApplicationFiled: September 3, 2025Publication date: March 5, 2026Inventors: Yaojie Shen, Xinyao Wang, Yulei Niu, Ying Zhou, Lexin Tang, Fan Chen, Longyin Wen
-
Publication number: 20260065525Abstract: A method, apparatus, device, and computer-readable storage medium for image processing are provided. The method includes receiving a text input for an initial image, the text input describing a visual effect for the initial image. A fusion feature for the text input and the initial image is generated based on the text input and the initial image. A target image corresponding to the initial image is generated based on a first image feature of the initial image and the fusion feature, the target image having a visual element related to the visual effect. The fusion of text and image can better express the desired visual effect.Type: ApplicationFiled: August 20, 2025Publication date: March 5, 2026Inventors: Xin Gu, Sijie Zhu, Fan Chen, Longyin Wen
-
Publication number: 20260065872Abstract: A display device and a driving method thereof are provided. The driving method includes the following steps: simultaneously controlling switches of an M-th row of sub-pixels and a (M+2)-th row of sub-pixels to be turned on or off, and sequentially controlling switches of four adjacent rows of the same color sub-pixels to be turned on or off.Type: ApplicationFiled: October 31, 2024Publication date: March 5, 2026Applicant: TCL CHINA STAR OPTOELECTRONICS TECHNOLOGY CO., LTD.Inventors: Shuming CHANG, Yoongu KIM, Xintian SU, Jiajia CHEN, Xiaoying GUO, Qiqi LIN, Fan CHEN, Lina WANG
-
Patent number: 12548597Abstract: Examples are provided relating to system evolving architectures for refining media content editing systems. One aspect includes a method of refining a media content editing architecture, the method comprising: editing a media content using a large language model and a back-end tool service comprising a prompt pool and a plurality of application programming interfaces corresponding to a plurality of editing tools; publishing the edited media content; storing contextual information relating to the editing of the media content; and refining the media content editing architecture using the stored contextual information.Type: GrantFiled: July 3, 2023Date of Patent: February 10, 2026Assignee: Lemon Inc.Inventors: Fan Chen, Kin Chung Wong
-
Patent number: 12542086Abstract: A display panel and a display device are disclosed. The display panel includes a plurality of scan lines, a plurality of data lines, and a plurality of sub-pixels. Each of the scan lines is electrically connected to a row of the sub-pixels. The plurality of sub-pixels include a first pixel column and a second pixel column. The first pixel column and the second pixel column are arranged adjacent to each other. The first pixel column includes a plurality of first pixel units. The second pixel column includes a plurality of second pixel units. Each of the first pixel unit and the second pixel unit includes three sub-pixels, and colors of the three sub-pixels are different. At least two adjacent first pixel units and at least two adjacent second pixel units are electrically connected to a same data line.Type: GrantFiled: November 30, 2023Date of Patent: February 3, 2026Assignee: TCL CHINA STAR OPTOELECTRONICS TECHNOLOGY CO., LTD.Inventors: Shuming Chang, Yating Wen, Weisheng Zheng, Yuning Zhang, Qian Wang, Shijie Deng, Zhaoming Liang, Fan Chen
-
Patent number: 12530768Abstract: The present disclosure relates to systems and methods for image storage. The methods may include obtaining a first image of a subject. The methods may further include obtaining a second image of the subject. The second image may include scan status information of the subject. The scan status information may be associated with a status of the subject when the first image is acquired. And The methods may also include storing the second image correspondingly with the first image.Type: GrantFiled: March 3, 2023Date of Patent: January 20, 2026Assignee: SHANGHAI UNITED IMAGING HEALTHCARE CO., LTD.Inventors: Yang Bu, Fan Chen
-
Patent number: 12518060Abstract: Examples are provided relating to implementing actions on social media network content based on natural language inputs. One aspect includes a computing system configured to implement a social media network, comprising one or more processors, and a storage device comprising instructions executable to receive a user input including a natural language description of a request for an action on a content item from a dialogue agent configured to engage in dialogue using at least a language model, and generate a prompt for the language model based at least on the user input. The instructions are further executable to input the prompt to the language model to generate output describing operations for implementing the action, call a backend service of the social media network to execute commands to implement the operations, and output a result of executing the commands.Type: GrantFiled: July 3, 2023Date of Patent: January 6, 2026Assignee: Lemon Inc.Inventors: Fan Chen, Kin Chung Wong
-
Publication number: 20250388857Abstract: The present discloses provides the use of tributyrin as an additive for embryonic development culture medium in vitro, in the present disclosure, the tributyrin is applied as an additive to the development culture medium of mouse embryos in vitro for the first time, the tributyrin can significantly increase the rate of blastocyst, and reduce the ROS content in the embryos, improve the mitochondrial membrane potential in the embryos, increase the ATP level and the expression of antioxidant genes in the embryos, improve the DNA methylation and histone modification level in the embryos, promote the embryos development in vitro; In addition, tributyrin, as natural antioxidant and apparent drug, which is safe, non-toxic and side effects; the tributyrin provides strong support for the efficient embryos development of human assisted reproductive technology, mammalian fertilization embryos, parthenogenetic embryos and somatic cell cloned embryos and other embryo engineering technologies in vitro.Type: ApplicationFiled: April 23, 2025Publication date: December 25, 2025Inventors: Fan Chen, Lixiang Zheng, Anfeng Luo, Yanzhen Bi, Hongyan Ren, Hao Gu, Changfan Zhou, Wei Zeng
-
Patent number: 12494016Abstract: The present disclosure is related to systems and methods for subject positioning. The method includes generating a reference subject model of a subject to be scanned by a medical device based on feature information of the subject and a scan protocol of the subject. The method includes generating a target subject model by performing one or more rendering operations on the reference subject model based on at least one rendering parameter associated with an image capturing device. The method includes generating a positioning result by integrating the target subject model with image data related to a scan scene of the subject captured by the image capturing device. The method includes positioning the subject based on the positioning result.Type: GrantFiled: May 17, 2023Date of Patent: December 9, 2025Assignee: SHANGHAI UNITED IMAGING HEALTHCARE CO., LTD.Inventors: Yang Bu, Fan Chen
-
Patent number: 12488034Abstract: The present disclosure describes techniques for searching editing components based on text using a machine learning model. A plurality of visual embeddings indicative of a plurality of visual editing components is acquired by the machine learning model. The plurality of visual embeddings indicative of the plurality of visual editing components is projected into a common space by a first sub-model of the machine learning model. A text query input is received by a user. A text embedding indicative of the text query is generated. The text embedding is projected into the common space by a second sub-model of the machine learning model. At least one visual editing component among the plurality of visual editing components is determined based on the projected text embedding and the plurality of projected visual embeddings in the common space. Information indicative of the at least one visual editing component is displayed via a user interface.Type: GrantFiled: March 7, 2024Date of Patent: December 2, 2025Assignee: Lemon Inc.Inventors: Sijie Zhu, Lu Xu, Fan Chen, Longyin Wen
-
Publication number: 20250342859Abstract: Examples are provided relating to media content editing architectures utilizing machine learning techniques. One aspect includes a method for media content editing, the method comprising: receiving a media content from a user; receiving an editing request for the media content from the user; and editing the media content based on the editing request to generate edited media content by: retrieving a prompt from a prompt pool, wherein the retrieved prompt is selected based on the editing request; parsing the retrieved prompt and the editing request using a large language model to generate one or more editing actions to be performed on the media content; and performing the one or more editing actions on the media content to generate the edited media content.Type: ApplicationFiled: July 10, 2025Publication date: November 6, 2025Inventors: Fan Chen, Kin Chung Wong
-
Publication number: 20250334713Abstract: A stability assessment method of roadway surrounding rock includes: obtaining actual stratum rock parameters; establishing a two-dimensional geological model through numerical simulation based on a drill core columnar diagram and the actual stratum rock parameters; changing influencing factors, and recording amounts and acceleration values of deformation of roadway sidewalls, and whether failure occurs to obtain dynamic response characteristics of roadway surrounding rock, combining the changed influencing factors and the dynamic response characteristics as labels to obtain a dataset, and obtaining multiple datasets including the dataset; dividing the multiple datasets into a training set and a validation set, inputting the training set into a PSO-BP neural network and a GA-SVM deep learning model to obtain a preliminary stability assessment model, and adjusting and validating the preliminary stability assessment model by the validation set to obtain an optimized stability assessment model; and using the optiType: ApplicationFiled: March 18, 2025Publication date: October 30, 2025Inventors: Anye Cao, Geng Li, Chengchun Xue, Yaoqi Liu, Fan Chen, Changbin Wang, Xu Yang, Guowei Lyu, Xianxi Bai, Yujie Peng
-
Publication number: 20250335797Abstract: The present disclosure describes techniques for generating image descriptions using a machine learning model. Mixture of Experts (MoE) blocks are incorporated into a plurality of sub-models of the machine learning model. The first sub-model of the machine learning model comprises at least one first MoE block including a first plurality of experts. A second sub-model of the machine learning model comprises at least one second MoE block including a second plurality of experts. Only a subset of the first plurality of experts is activated to generate visual tokens based on an input image. Only a subset of the second plurality of experts is activated to project the visual tokens into an input space of the third sub-model. A text description of the input image is output by the third sub-model of the machine learning model.Type: ApplicationFiled: September 6, 2024Publication date: October 30, 2025Inventors: Xinyao Wang, Jiachen Li, Sijie Zhu, Chia-Wen Kuo, Lu Xu, Fan Chen, Longyin Wen