Patents by Inventor Zhifei Zhang

Zhifei Zhang has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 12271983
    Abstract: The present disclosure relates to systems, methods, and non-transitory computer-readable media that implements related image search and image modification processes using various search engines and a consolidated graphical user interface. For instance, in one or more embodiments, the disclosed systems receive an input digital image and search input and further modify the input digital image using the image search results retrieved in response to the search input. In some cases, the search input includes a multi-modal search input having multiple queries (e.g., an image query and a text query), and the disclosed systems retrieve the image search results utilizing a weighted combination of the queries. In some implementations, the disclosed systems generate an input embedding for the search input (e.g., the multi-modal search input) and retrieve the image search results using the input embedding.
    Type: Grant
    Filed: June 28, 2022
    Date of Patent: April 8, 2025
    Assignee: Adobe Inc.
    Inventors: Zhifei Zhang, Zhe Lin, Scott Cohen, Kevin Gary Smith
  • Publication number: 20250069297
    Abstract: The present disclosure relates to systems, non-transitory computer-readable media, and methods for transferring global style features between digital images utilizing one or more machine learning models or neural networks. In particular, in one or more embodiments, the disclosed systems receive a request to transfer a global style from a source digital image to a target digital image, identify at least one target object within the target digital image, and transfer the global style from the source digital image to the target digital image while maintaining an object style of the at least one target object.
    Type: Application
    Filed: November 15, 2024
    Publication date: February 27, 2025
    Inventors: Zhifei Zhang, Zhe Lin, Scott Cohen, Darshan Prasad, Zhihong Ding
  • Patent number: 12235891
    Abstract: Systems, methods, and non-transitory computer-readable media implements related image search and image modification processes using various search engines and a consolidated graphical user interface. For instance, one or more embodiments involve receiving an input digital image and search input and further modifying the input digital image using the image search results retrieved in response to the search input. In some cases, the search input includes a multi-modal search input having multiple queries (e.g., an image query and a text query), and one or more embodiments involve retrieving the image search results utilizing a weighted combination of the queries. Some implementations involve generating an input embedding for the search input (e.g., the multi-modal search input) and retrieving the image search results using the input embedding.
    Type: Grant
    Filed: June 28, 2022
    Date of Patent: February 25, 2025
    Assignee: Adobe Inc.
    Inventors: Zhifei Zhang, Zhe Lin
  • Patent number: 12231872
    Abstract: An audio signal playing method and apparatus, and an electronic device are provided. The method comprises: separating, from a first audio signal, a recorded audio signal corresponding to each of at least one sound source; on the basis of the first audio signal, determining a real-time orientation of each of the at least one sound source relative to the head of a user; for each sound source, according to the real-time orientation of the sound source and the recorded audio signal corresponding to the sound source, generating a target direct audio signal corresponding to the sound source, and generating a target reverberated audio signal corresponding to the sound source; and playing a second audio signal that is generated by means of fusing the target direct audio signal and the target reverberated audio signal corresponding to each sound source.
    Type: Grant
    Filed: February 28, 2024
    Date of Patent: February 18, 2025
    Assignee: Beijing Youzhuju Network Technology Co., Ltd.
    Inventors: Zheng Xue, Yangfei Xu, Wenzhi Fan, Zhifei Zhang, Yuzhou Gong, Zejun Ma
  • Publication number: 20250046055
    Abstract: This disclosure describes one or more implementations of systems, non-transitory computer-readable media, and methods that trains (and utilizes) an image color editing diffusion neural network to generate a color edited digital image(s) for a digital image. In particular, in one or more implementations, the disclosed systems identify a digital image depicting content in a first color style. Moreover, the disclosed systems generate, from the digital image utilizing an image color editing diffusion neural network, a color-edited digital image depicting the content in a second color style different from the first color style. Further, the disclosed systems provide, for display within a graphical user interface, the color-edited digital image.
    Type: Application
    Filed: August 2, 2023
    Publication date: February 6, 2025
    Inventors: Zhifei Zhang, Zhe Lin, Yixuan Ren, Yifei Fan, Jing Shi
  • Patent number: 12217395
    Abstract: Systems and methods for image processing are configured. Embodiments of the present disclosure encode a content image and a style image using a machine learning model to obtain content features and style features, wherein the content image includes a first object having a first appearance attribute and the style image includes a second object having a second appearance attribute; align the content features and the style features to obtain a sparse correspondence map that indicates a correspondence between a sparse set of pixels of the content image and corresponding pixels of the style image; and generate a hybrid image based on the sparse correspondence map, wherein the hybrid image depicts the first object having the second appearance attribute.
    Type: Grant
    Filed: April 27, 2022
    Date of Patent: February 4, 2025
    Assignee: ADOBE INC.
    Inventors: Sangryul Jeon, Zhifei Zhang, Zhe Lin, Scott Cohen, Zhihong Ding
  • Publication number: 20250022099
    Abstract: Systems and methods for image compositing are provided. An aspect of the systems and methods includes obtaining a first image and a second image, wherein the first image includes a target location and the second image includes a target element; encoding the second image using an image encoder to obtain an image embedding; generating a descriptive embedding based on the image embedding using an adapter network; and generating a composite image based on the descriptive embedding and the first image using an image generation model, wherein the composite image depicts the target element from the second image at the target location of the first image.
    Type: Application
    Filed: July 13, 2023
    Publication date: January 16, 2025
    Inventors: Yizhi Song, Zhifei Zhang, Zhe Lin, Scott Cohen, Brian Lynn Price, Jianming Zhang, Soo Ye Kim
  • Publication number: 20240404013
    Abstract: Embodiments include systems and methods for generative image filling based on text and a reference image. In one aspect, the system obtains an input image, a reference image, and a text prompt. Then, the system encodes the reference image to obtain an image embedding and encodes the text prompt to obtain a text embedding. Subsequently, a composite image is generated based on the input image, the image embedding, and the text embedding.
    Type: Application
    Filed: November 21, 2023
    Publication date: December 5, 2024
    Inventors: Yuqian Zhou, Krishna Kumar Singh, Zhe Lin, Qing Liu, Zhifei Zhang, Sohrab Amirghodsi, Elya Shechtman, Jingwan Lu
  • Patent number: 12154196
    Abstract: The present disclosure relates to systems, non-transitory computer-readable media, and methods for transferring global style features between digital images utilizing one or more machine learning models or neural networks. In particular, in one or more embodiments, the disclosed systems receive a request to transfer a global style from a source digital image to a target digital image, identify at least one target object within the target digital image, and transfer the global style from the source digital image to the target digital image while maintaining an object style of the at least one target object.
    Type: Grant
    Filed: July 1, 2022
    Date of Patent: November 26, 2024
    Assignee: Adobe Inc.
    Inventors: Zhifei Zhang, Zhe Lin, Scott Cohen, Darshan Prasad, Zhihong Ding
  • Publication number: 20240338869
    Abstract: An image processing system obtains an input image (e.g., a user provided image, etc.) and a mask indicating an edit region of the image. A user selects an image editing mode for an image generation network from a plurality of image editing modes. The image generation network generates an output image using the input image, the mask, and the image editing mode.
    Type: Application
    Filed: September 26, 2023
    Publication date: October 10, 2024
    Inventors: Yuqian Zhou, Krishna Kumar Singh, Zhifei Zhang, Difan Liu, Zhe Lin, Jianming Zhang, Qing Liu, Jingwan Lu, Elya Shechtman, Sohrab Amirghodsi, Connelly Stuart Barnes
  • Publication number: 20240265505
    Abstract: Systems and methods for image processing are described. Embodiments of the present disclosure obtain a noise image and guidance information for generating an image. A diffusion model generates an intermediate noise prediction for the image based on the noise image. A conditioning network generates noise modulation parameters. The intermediate noise prediction and the noise modulation parameters are combined to obtain a modified intermediate noise prediction. The diffusion model generates the image based on the modified intermediate noise prediction, wherein the image depicts a scene based on the guidance information.
    Type: Application
    Filed: February 6, 2023
    Publication date: August 8, 2024
    Inventors: Cusuh Ham, Tobias Hinz, Jingwan Lu, Krishna Kumar Singh, Zhifei Zhang
  • Publication number: 20240205634
    Abstract: An audio signal playing method and apparatus, and an electronic device are provided. The method comprises: separating, from a first audio signal, a recorded audio signal corresponding to each of at least one sound source; on the basis of the first audio signal, determining a real-time orientation of each of the at least one sound source relative to the head of a user; for each sound source, according to the real-time orientation of the sound source and the recorded audio signal corresponding to the sound source, generating a target direct audio signal corresponding to the sound source, and generating a target reverberated audio signal corresponding to the sound source; and playing a second audio signal that is generated by means of fusing the target direct audio signal and the target reverberated audio signal corresponding to each sound source.
    Type: Application
    Filed: February 28, 2024
    Publication date: June 20, 2024
    Inventors: Zheng XUE, Yangfei XU, Wenzhi FAN, Zhifei ZHANG, Yuzhou GONG, Zejun MA
  • Publication number: 20240169622
    Abstract: Systems and methods for multi-modal image editing are provided. In one aspect, a system and method for multi-modal image editing includes identifying an image, a prompt identifying an element to be added to the image, and a mask indicating a first region of the image for depicting the element. The system then generates a partially noisy image map that includes noise in the first region and image features from the image in a second region outside the first region. A diffusion model generates a composite image map based on the partially noisy image map and the prompt. In some cases, the composite image map includes the target element in the first region that corresponds to the mask.
    Type: Application
    Filed: November 22, 2022
    Publication date: May 23, 2024
    Inventors: Shaoan Xie, Zhifei Zhang, Zhe Lin, Tobias Hinz
  • Patent number: 11977829
    Abstract: The present disclosure relates to systems, methods, and non-transitory computer readable media for accurately and flexibly generating scalable and semantically editable font representations utilizing a machine learning approach. For example, the disclosed systems generate a font representation code from a glyph utilizing a particular neural network architecture. For example, the disclosed systems utilize a glyph appearance propagation model and perform an iterative process to generate a font representation code from an initial glyph. Additionally, using a glyph appearance propagation model, the disclosed systems automatically propagate the appearance of the initial glyph from the font representation code to generate additional glyphs corresponding to respective glyph labels. In some embodiments, the disclosed systems propagate edits or other changes in appearance of a glyph to other glyphs within a glyph set (e.g., to match the appearance of the edited glyph).
    Type: Grant
    Filed: June 29, 2021
    Date of Patent: May 7, 2024
    Assignee: Adobe Inc.
    Inventors: Zhifei Zhang, Zhaowen Wang, Hailin Jin, Matthew Fisher
  • Publication number: 20240135514
    Abstract: The present disclosure relates to systems, methods, and non-transitory computer-readable media that modify digital images via multi-layered scene completion techniques facilitated by artificial intelligence. For instance, in some embodiments, the disclosed systems receive a digital image portraying a first object and a second object against a background, where the first object occludes a portion of the second object. Additionally, the disclosed systems pre-process the digital image to generate a first content fill for the portion of the second object occluded by the first object and a second content fill for a portion of the background occluded by the second object. After pre-processing, the disclosed systems detect one or more user interactions to move or delete the first object from the digital image. The disclosed systems further modify the digital image by moving or deleting the first object and exposing the first content fill for the portion of the second object.
    Type: Application
    Filed: September 1, 2023
    Publication date: April 25, 2024
    Inventors: Daniil Pakhomov, Qing Liu, Zhihong Ding, Scott Cohen, Zhe Lin, Jianming Zhang, Zhifei Zhang, Ohiremen Dibua, Mariette Souppe, Krishna Kumar Singh, Jonathan Brandt
  • Patent number: 11938283
    Abstract: A bendable sheath and a delivery system using the bendable sheath. The bendable sheath comprises a tube body (3). The tube body (3) comprises a distal end and a proximal end. A tube wall of the tube body (3) is connected to a pull wire (8). One end of the pull wire (8) extends towards the proximal end of the tube body (3), and the other end is connected to the tube body (3) near the distal end of the tube body (3). The pull wire (8) comprises at least a section thereof disposed freely outside the tube body (3) and near the distal end of the tube body (3). The pull wire (8) in the bendable sheath comprises the section disposed freely outside the sheath tube body (3) and, when pulled, the section is disposed so as to facilitate the application of force. The section moves relative to the tube body (3), such that a force application point is adaptively changed.
    Type: Grant
    Filed: September 2, 2020
    Date of Patent: March 26, 2024
    Assignee: VENUS MEDTECH (HANGZHOU), INC.
    Inventors: Mao Chen, Yuan Feng, Zhifei Zhang, Feng Guo, Quangang Gong, Shiguang Wu
  • Publication number: 20240088656
    Abstract: Provided are an off-grid start method and system for a new energy power generation system. The method includes: gradually boosting the voltage of a master according to a plurality of preset voltage given values, and slaves determining, by means of measuring a voltage of the load of a system, a target voltage given value used by the master; and the master determining, by means of monitoring an output current of the master itself, that a slave is successfully connected in parallel, and then continuing to boost the output voltage until all the slaves run in parallel. Therefore, according to the solution, no upper-layer synchronous control is required during a black-start process, and no communication between a master and slaves is required.
    Type: Application
    Filed: November 8, 2021
    Publication date: March 14, 2024
    Applicant: Sungrow Power Supply Co., Ltd.
    Inventors: Xing Li, Houlai Geng, Qun Zheng, Menglin Cao, Zhifei Zhang
  • Patent number: 11925557
    Abstract: A sheath facilitating retraction of prosthetic implant is provided. The sheath includes a tubular body, and the distal end of the body is connected with an expandable section. The expandable section has relative converged configuration and flared configuration and includes a primary expandable area and a secondary expandable area. The primary expandable area includes a plurality of first expandable pieces arranged at intervals in a circumferential direction of the body. The secondary expandable area includes a plurality of second expandable pieces arranged at intervals in the circumferential direction of the body, and all the second expandable pieces include two alternating groups consisting of a first group formed by further extending the first expandable pieces to the distal end and a second group formed by connecting strips wound between two adjacent first expandable pieces.
    Type: Grant
    Filed: June 29, 2023
    Date of Patent: March 12, 2024
    Assignee: VENUS MEDTECH (HANGZHOU) INC.
    Inventors: Zhifei Zhang, Jianan Wang, Meirong Liu
  • Publication number: 20240038252
    Abstract: A sound signal processing method, an electronic device, and computer-readable medium are provided. The method includes: importing first frequency spectrum data corresponding to first audio data into a pre-trained sound processing model to obtain a processing result; and generating, based on the processing result, pure audio data corresponding to the first audio data. The sound processing model includes at least one preset convolution layer, and operations performed by using the preset convolution layer includes: performing, based on a first convolution kernel group, a convolution operation on a first sound spectrum feature map inputted into the preset convolution layer, to obtain a second sound spectrum feature map; and combining, based on a second convolution kernel group, the obtained second sound spectrum feature map, to obtain a third sound spectrum feature map corresponding to the second convolution kernel group.
    Type: Application
    Filed: December 3, 2021
    Publication date: February 1, 2024
    Inventors: Wenzhi FAN, Fanliu KONG, Yangfei XU, Zhifei ZHANG
  • Patent number: 11875435
    Abstract: The present disclosure relates to systems, methods, and non-transitory computer-readable media for accurately and flexibly generating scalable fonts utilizing multi-implicit neural font representations. For instance, the disclosed systems combine deep learning with differentiable rasterization to generate a multi-implicit neural font representation of a glyph. For example, the disclosed systems utilize an implicit differentiable font neural network to determine a font style code for an input glyph as well as distance values for locations of the glyph to be rendered based on a glyph label and the font style code. Further, the disclosed systems rasterize the distance values utilizing a differentiable rasterization model and combines the rasterized distance values to generate a permutation-invariant version of the glyph corresponding glyph set.
    Type: Grant
    Filed: October 12, 2021
    Date of Patent: January 16, 2024
    Assignee: Adobe Inc.
    Inventors: Chinthala Pradyumna Reddy, Zhifei Zhang, Matthew Fisher, Hailin Jin, Zhaowen Wang, Niloy J Mitra