Patents by Inventor Ann Lee

Ann Lee has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20260100204
    Abstract: A method to generate synchronized audio for a video includes receiving the video including a sequence of frames and receiving a text input describing at least one of a scene, an event, or a mood to be reflected in an audio track. The method also includes generating a latent audio representation via an audio generation model conditioned jointly on video embeddings associated with the sequence of frames and text embeddings associated with the text input. The method also includes decoding the latent audio representation to produce an audio track temporally aligned with the video and semantically consistent with the text input.
    Type: Application
    Filed: October 2, 2025
    Publication date: April 9, 2026
    Inventors: Zecheng He, Samaneh Azadi, Bowen Shi, Apoorv Vyas, Ann Lee, Ishan Satish Misra, Peizhao Zhang, Roshan Rajesh Sumbaly, Yaniv Nechemia Taigman, Peter Vajda, Yi-Chiao Wu, Andros Tjandra, Wei-Ning Hsu, Amit Zohar, Animesh Sinha, Yuval Kirstain, Shelly Sheynin, Adam Polyak, Matthew Le, Juefei Xu, Haoyu Ma, Tingbo Hou
  • Publication number: 20260100203
    Abstract: A method to edit a video includes receiving an input video including a sequence of frames and receiving an editing instruction expressed in natural language. The method also includes generating a multimodal condition based on the textual editing instruction and the input video. The multimodal condition may include an embedding of the input video concatenated with an embedding of the textual editing instruction. The method also includes applying, via a video editing model, the multimodal condition to modify visual content of the input video. The method further includes generating an edited video including visual modifications corresponding to the textual editing instruction. The edited video preserves temporal coherence and overall visual fidelity of the input video.
    Type: Application
    Filed: October 2, 2025
    Publication date: April 9, 2026
    Inventors: Zecheng He, Samaneh Azadi, Bowen Shi, Apoorv Vyas, Ann Lee, Ishan Satish Misra, Peizhao Zhang, Roshan Rajesh Sumbaly, Yaniv Nechemia Taigman, Peter Vajda, Yi-Chiao Wu, Andros Tjandra, Wei-Ning Hsu, Amit Zohar, Animesh Sinha, Yuval Kirstain, Shelly Sheynin, Adam Polyak, Matthew Le, Juefei Xu, Haoyu Ma, Tingbo Hou
  • Publication number: 20260101081
    Abstract: A system and method to generate a video is provided. The method may include generating, based on a user input including a description of a desired video, a structured script including one or more of scene descriptions, dialogue, or explicit shot-level information. The method also includes generating, based on the structured script, a sequence of video frames representing one or more scenes. The method further includes generating, based on the structured script and the sequence of video frames, an audio track including one or more of ambient sounds, sound effects, or music. The generated audio track being temporally synchronized with the sequence of video frames. The method also includes combining the sequence of video frames with the audio track to generate a synchronized video output representing the desired video.
    Type: Application
    Filed: October 2, 2025
    Publication date: April 9, 2026
    Inventors: Zecheng He, Samaneh Azadi, Bowen Shi, Apoorv Vyas, Ann Lee, Ishan Satish Misra, Peizhao Zhang, Roshan Rajesh Sumbaly, Yaniv Nechemia Taigman, Peter Vajda, Yi-Chiao Wu, Andros Tjandra, Wei-Ning Hsu, Amit Zohar, Animesh Sinha, Yuval Kirstain, Shelly Sheynin, Adam Polyak, Matthew Le, Juefei Xu, Haoyu Ma, Tingbo Hou
  • Publication number: 20260099978
    Abstract: A method to generate a video includes receiving an input describing a scene. The method also includes receiving a reference image depicting a character. The method further includes generating, via an encoder, embeddings of identity features of the reference image. The method also includes generating, via a video generation model, the video in which the character appears with consistent likeness across multiple frames in accordance with the embeddings and the text prompt.
    Type: Application
    Filed: October 2, 2025
    Publication date: April 9, 2026
    Inventors: Zecheng He, Samaneh Azadi, Bowen Shi, Apoorv Vyas, Ann Lee, Ishan Satish Misra, Peizhao Zhang, Roshan Rajesh Sumbaly, Yaniv Nechemia Taigman, Peter Vajda, Yi-Chiao Wu, Andros Tjandra, Wei-Ning Hsu, Amit Zohar, Animesh Sinha, Yuval Kirstain, Shelly Sheynin, Adam Polyak, Matthew Le, Juefei Xu, Haoyu Ma, Tingbo Hou
  • Publication number: 20230186035
    Abstract: In one embodiment, a method includes accessing a first utterance of a content by a first speaker, generating first discrete speech units from the first utterance based on a speech-learning model, wherein each of the first discrete speech units is associated with a speech cluster, accessing second utterances of the content by second speakers different from the first speaker, and training a speech normalizer by processing each of the second utterances using the speech normalizer to generate second discrete speech units and updating the speech normalizer by using the first discrete speech units as an optimization target for the second discrete speech units associated with each of the second utterances.
    Type: Application
    Filed: August 16, 2022
    Publication date: June 15, 2023
    Inventors: Ann Lee, Peng-Jen Chen, Holger Schwenk, Jiatao Gu, Wei-Ning Hsu
  • Patent number: 10701120
    Abstract: Systems and methods for sharing protected media content are provided. Protected media content can be shared when at least a first and second user are proximately located. The first and second user can be bound or paired based on one or more identification indicia associated with first and second user devices utilized, owned, or operated by the first and second users, respectively. Upon pairing, media content from the first and second users' media content libraries can be shared. Additionally, proximate location can be leveraged to surface media content to other users, giving such other users the opportunity to discover new media content, and otherwise engage in transactions involving the new media content. Further still, the most popular media content associated with the second user or group of users proximate to the first user can be determined and used to prompt further interaction or display information regarding such popular media content.
    Type: Grant
    Filed: June 19, 2015
    Date of Patent: June 30, 2020
    Assignee: Disney Enterprises, Inc.
    Inventors: Christopher S. Taylor, Mark Arana, Josiah Eatedali, Edward Drake, Ann Lee, Anthony Mutalipassi
  • Patent number: 10346034
    Abstract: A method for dynamically generating a personalized handwriting character font includes inputting a plurality of handwriting sequentially through an input interface. Each handwriting describes a character. Then, the positions of strokes of characters in the input interface described by the plurality of handwriting are identified. Next, font characteristics of the characters are determined according to the positions of strokes in the input interface. A personalized handwriting character font characteristic is determined according to the font characteristics. Finally, a new character font file with a personalized handwriting character font is generated according to the personalized handwriting character font characteristic.
    Type: Grant
    Filed: September 13, 2016
    Date of Patent: July 9, 2019
    Assignee: DynaComware Taiwan Inc.
    Inventors: Fu-Jen Wang, Ji-Ming Chen, Ann Lee
  • Publication number: 20170109034
    Abstract: A method for dynamically generating a personalized handwriting character font includes inputting a plurality of handwriting sequentially through an input interface. Each handwriting describes a character. Then, the positions of strokes of characters in the input interface described by the plurality of handwriting are identified. Next, font characteristics of the characters are determined according to the positions of strokes in the input interface. A personalized handwriting character font characteristic is determined according to the font characteristics. Finally, a new character font file with a personalized handwriting character font is generated according to the personalized handwriting character font characteristic.
    Type: Application
    Filed: September 13, 2016
    Publication date: April 20, 2017
    Inventors: Fu-Jen WANG, Ji-Ming CHEN, Ann LEE
  • Publication number: 20160261658
    Abstract: Systems and methods for sharing protected media content are provided. Protected media content can be shared when at least a first and second user are proximately located. The first and second user can be bound or paired based on one or more identification indicia associated with first and second user devices utilized, owned, or operated by the first and second users, respectively. Upon pairing, media content from the first and second users' media content libraries can be shared. Additionally, proximate location can be leveraged to surface media content to other users, giving such other users the opportunity to discover new media content, and otherwise engage in transactions involving the new media content. Further still, the most popular media content associated with the second user or group of users proximate to the first user can be determined and used to prompt further interaction or display information regarding such popular media content.
    Type: Application
    Filed: June 19, 2015
    Publication date: September 8, 2016
    Applicant: Disney Enterprises, Inc.
    Inventors: CHRISTOPHER S. TAYLOR, Mark Arana, Josiah Eatedali, Edward Drake, ANN LEE, Anthony Mutalipassi
  • Publication number: 20150286723
    Abstract: Systems, methods, and computer-readable storage media are provided for identifying dominant entity categories associated with target entities. A target entity is received and plural data sources are utilized to determine entity categories of which the target entity is a member and an initial confidence score for each of the entity categories. Each initial confidence score represents the likelihood that the associated entity category is a dominant category for the target entity. At least one data source includes information pertaining to plural entities arranged in a graph-based ontology that includes identifiers of respective entity categories of which the subject entities are members. Graph-based confidence score propagation is then utilized to incorporate information regarding entities determined to be related to the target entity and accolades associated with the target entity to alter the initial confidence scores provided for various entity categories of which the target entity is a member.
    Type: Application
    Filed: April 7, 2014
    Publication date: October 8, 2015
    Applicant: MICROSOFT CORPORATION
    Inventors: WALTER SUN, HUNG-AN CHANG, JINGFENG LI, ANN LEE
  • Patent number: 7573988
    Abstract: A computer-implemented system is provided, including a network consisting of the Internet, PSTN, and CATV network. The network is connected to multiple users' client systems, and also to multiple service provider systems. Each user has a user profile stored in a User Preference Database, while each service provider has service parameters defining the type of its service stored in a Service Database, both of which are connected to the network. The system further includes a Gatekeeper Server, which establishes voice communication among the client systems and the service provider systems. In operation, upon receiving a user's request for service, the Gatekeeper Server identifies one or more service providers whose service parameters match the user's request for service and the user's profile. Upon the user's selection of one such service provider, the Gatekeeper server automatically selects and establishes a preferred mode of voice connection between the user and the selected service provider.
    Type: Grant
    Filed: June 2, 2004
    Date of Patent: August 11, 2009
    Assignee: DynaLab Inc.
    Inventors: Fisher Chen-yin Lee, Ann Lee, David Liu
  • Publication number: 20080118970
    Abstract: A process for purifying virus particles, especially recombinant adenovirus vector particles, is presented. The process relies on various combinations of cell lysis, detergent-based precipitation of host cell contaminants away from the virus, depth filtration or centrifugation, ultrafiltration, nuclease digestion and chromatography to robustly and economically produce highly purified product. This process results in contaminating DNA levels which are consistently below detectable levels.
    Type: Application
    Filed: December 7, 2007
    Publication date: May 22, 2008
    Inventors: John Konz, Ann Lee, Chi To, Aaron Goerke
  • Publication number: 20060004753
    Abstract: The present invention is directed to a method and computer system for representing a dataset comprising N documents by computing a diffusion geometry of the dataset comprising at least a plurality of diffusion coordinates. The present method and system stores a number of diffusion coordinates, wherein the number is linear in proportion to N.
    Type: Application
    Filed: June 23, 2005
    Publication date: January 5, 2006
    Inventors: Ronald Coifman, Andreas Coppi, Frank Geshwind, Stephane Lafon, Ann Lee, Mauro Maggioni, Frederick Warner, Steven Zucker, William Fateley
  • Publication number: 20050286711
    Abstract: A computer-implemented system is provided, including a network consisting of the Internet, PSTN, and CATV network. The network is connected to multiple users' client systems, and also to multiple service provider systems. Each user has a user profile stored in a User Preference Database, while each service provider has service parameters defining the type of its service stored in a Service Database, both of which are connected to the network. The system further includes a Gatekeeper Server, which establishes voice communication among the client systems and the service provider systems. In operation, upon receiving a user's request for service, the Gatekeeper Server identifies one or more service providers whose service parameters match the user's request for service and the user's profile. Upon the user's selection of one such service provider, the Gatekeeper server automatically selects and establishes a preferred mode of voice connection between the user and the selected service provider.
    Type: Application
    Filed: June 2, 2004
    Publication date: December 29, 2005
    Inventors: Fisher Lee, Ann Lee, David Liu
  • Publication number: 20050196854
    Abstract: A process for purifying virus particles, especially recombinant adenovirus vector particles, is presented. The process relies on various combinations of cell lysis, detergent-based precipitation of host cell contaminants away from the virus, depth filtration or centrifugation, ultrafiltration, nuclease digestion and chromatography to robustly and economically produce highly purified product. This process results in contaminating DNA levels which are consistently below detectable levels.
    Type: Application
    Filed: May 13, 2003
    Publication date: September 8, 2005
    Inventors: John Konz, Ann Lee, Chi To, Aaron Goerke
  • Publication number: 20050153420
    Abstract: A process for purifying virus particles, especially recombinant adenovirus vector particles, is presented. The process relies on various combinations of cell lysis, detergent-based precipitation of host cell contaminants away from the virus, depth filtration or centrifugation, ultrafiltration, nuclease digestion and chromatography to robustly and economically produce highly purified product. This process results in contaminating DNA levels which are consistently below detectable levels.
    Type: Application
    Filed: May 13, 2003
    Publication date: July 14, 2005
    Inventors: John Konz Jr., Ann Lee, Chin To, Aaron Goerke
  • Publication number: 20050108406
    Abstract: The present invention provides a method, system, and software for dynamically generating a customized menu page on a display of a client system coupled to a network, such as the Internet. The menu page includes a number of selectable icons, each associated with a particular Web site, service, Web guide channel, etc., that the user is likely to wish to access. The menu page is “customized” in the sense that each menu page is generated based on each user's network log history and preferences, as stored in a user preference database, so as to present only those information sources/services that he/she would want to access. The menu is “dynamically” generated in the sense that the user preference database is constantly updated so as to present a menu page that reflects the user's most recent preferences and history.
    Type: Application
    Filed: November 7, 2003
    Publication date: May 19, 2005
    Inventors: Fisher Lee, Ann Lee