Patents by Inventor Ann Lee
Ann Lee has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20260100204Abstract: A method to generate synchronized audio for a video includes receiving the video including a sequence of frames and receiving a text input describing at least one of a scene, an event, or a mood to be reflected in an audio track. The method also includes generating a latent audio representation via an audio generation model conditioned jointly on video embeddings associated with the sequence of frames and text embeddings associated with the text input. The method also includes decoding the latent audio representation to produce an audio track temporally aligned with the video and semantically consistent with the text input.Type: ApplicationFiled: October 2, 2025Publication date: April 9, 2026Inventors: Zecheng He, Samaneh Azadi, Bowen Shi, Apoorv Vyas, Ann Lee, Ishan Satish Misra, Peizhao Zhang, Roshan Rajesh Sumbaly, Yaniv Nechemia Taigman, Peter Vajda, Yi-Chiao Wu, Andros Tjandra, Wei-Ning Hsu, Amit Zohar, Animesh Sinha, Yuval Kirstain, Shelly Sheynin, Adam Polyak, Matthew Le, Juefei Xu, Haoyu Ma, Tingbo Hou
-
Publication number: 20260100203Abstract: A method to edit a video includes receiving an input video including a sequence of frames and receiving an editing instruction expressed in natural language. The method also includes generating a multimodal condition based on the textual editing instruction and the input video. The multimodal condition may include an embedding of the input video concatenated with an embedding of the textual editing instruction. The method also includes applying, via a video editing model, the multimodal condition to modify visual content of the input video. The method further includes generating an edited video including visual modifications corresponding to the textual editing instruction. The edited video preserves temporal coherence and overall visual fidelity of the input video.Type: ApplicationFiled: October 2, 2025Publication date: April 9, 2026Inventors: Zecheng He, Samaneh Azadi, Bowen Shi, Apoorv Vyas, Ann Lee, Ishan Satish Misra, Peizhao Zhang, Roshan Rajesh Sumbaly, Yaniv Nechemia Taigman, Peter Vajda, Yi-Chiao Wu, Andros Tjandra, Wei-Ning Hsu, Amit Zohar, Animesh Sinha, Yuval Kirstain, Shelly Sheynin, Adam Polyak, Matthew Le, Juefei Xu, Haoyu Ma, Tingbo Hou
-
Publication number: 20260101081Abstract: A system and method to generate a video is provided. The method may include generating, based on a user input including a description of a desired video, a structured script including one or more of scene descriptions, dialogue, or explicit shot-level information. The method also includes generating, based on the structured script, a sequence of video frames representing one or more scenes. The method further includes generating, based on the structured script and the sequence of video frames, an audio track including one or more of ambient sounds, sound effects, or music. The generated audio track being temporally synchronized with the sequence of video frames. The method also includes combining the sequence of video frames with the audio track to generate a synchronized video output representing the desired video.Type: ApplicationFiled: October 2, 2025Publication date: April 9, 2026Inventors: Zecheng He, Samaneh Azadi, Bowen Shi, Apoorv Vyas, Ann Lee, Ishan Satish Misra, Peizhao Zhang, Roshan Rajesh Sumbaly, Yaniv Nechemia Taigman, Peter Vajda, Yi-Chiao Wu, Andros Tjandra, Wei-Ning Hsu, Amit Zohar, Animesh Sinha, Yuval Kirstain, Shelly Sheynin, Adam Polyak, Matthew Le, Juefei Xu, Haoyu Ma, Tingbo Hou
-
Publication number: 20260099978Abstract: A method to generate a video includes receiving an input describing a scene. The method also includes receiving a reference image depicting a character. The method further includes generating, via an encoder, embeddings of identity features of the reference image. The method also includes generating, via a video generation model, the video in which the character appears with consistent likeness across multiple frames in accordance with the embeddings and the text prompt.Type: ApplicationFiled: October 2, 2025Publication date: April 9, 2026Inventors: Zecheng He, Samaneh Azadi, Bowen Shi, Apoorv Vyas, Ann Lee, Ishan Satish Misra, Peizhao Zhang, Roshan Rajesh Sumbaly, Yaniv Nechemia Taigman, Peter Vajda, Yi-Chiao Wu, Andros Tjandra, Wei-Ning Hsu, Amit Zohar, Animesh Sinha, Yuval Kirstain, Shelly Sheynin, Adam Polyak, Matthew Le, Juefei Xu, Haoyu Ma, Tingbo Hou
-
Publication number: 20230186035Abstract: In one embodiment, a method includes accessing a first utterance of a content by a first speaker, generating first discrete speech units from the first utterance based on a speech-learning model, wherein each of the first discrete speech units is associated with a speech cluster, accessing second utterances of the content by second speakers different from the first speaker, and training a speech normalizer by processing each of the second utterances using the speech normalizer to generate second discrete speech units and updating the speech normalizer by using the first discrete speech units as an optimization target for the second discrete speech units associated with each of the second utterances.Type: ApplicationFiled: August 16, 2022Publication date: June 15, 2023Inventors: Ann Lee, Peng-Jen Chen, Holger Schwenk, Jiatao Gu, Wei-Ning Hsu
-
Patent number: 10701120Abstract: Systems and methods for sharing protected media content are provided. Protected media content can be shared when at least a first and second user are proximately located. The first and second user can be bound or paired based on one or more identification indicia associated with first and second user devices utilized, owned, or operated by the first and second users, respectively. Upon pairing, media content from the first and second users' media content libraries can be shared. Additionally, proximate location can be leveraged to surface media content to other users, giving such other users the opportunity to discover new media content, and otherwise engage in transactions involving the new media content. Further still, the most popular media content associated with the second user or group of users proximate to the first user can be determined and used to prompt further interaction or display information regarding such popular media content.Type: GrantFiled: June 19, 2015Date of Patent: June 30, 2020Assignee: Disney Enterprises, Inc.Inventors: Christopher S. Taylor, Mark Arana, Josiah Eatedali, Edward Drake, Ann Lee, Anthony Mutalipassi
-
Patent number: 10346034Abstract: A method for dynamically generating a personalized handwriting character font includes inputting a plurality of handwriting sequentially through an input interface. Each handwriting describes a character. Then, the positions of strokes of characters in the input interface described by the plurality of handwriting are identified. Next, font characteristics of the characters are determined according to the positions of strokes in the input interface. A personalized handwriting character font characteristic is determined according to the font characteristics. Finally, a new character font file with a personalized handwriting character font is generated according to the personalized handwriting character font characteristic.Type: GrantFiled: September 13, 2016Date of Patent: July 9, 2019Assignee: DynaComware Taiwan Inc.Inventors: Fu-Jen Wang, Ji-Ming Chen, Ann Lee
-
Publication number: 20170109034Abstract: A method for dynamically generating a personalized handwriting character font includes inputting a plurality of handwriting sequentially through an input interface. Each handwriting describes a character. Then, the positions of strokes of characters in the input interface described by the plurality of handwriting are identified. Next, font characteristics of the characters are determined according to the positions of strokes in the input interface. A personalized handwriting character font characteristic is determined according to the font characteristics. Finally, a new character font file with a personalized handwriting character font is generated according to the personalized handwriting character font characteristic.Type: ApplicationFiled: September 13, 2016Publication date: April 20, 2017Inventors: Fu-Jen WANG, Ji-Ming CHEN, Ann LEE
-
Publication number: 20160261658Abstract: Systems and methods for sharing protected media content are provided. Protected media content can be shared when at least a first and second user are proximately located. The first and second user can be bound or paired based on one or more identification indicia associated with first and second user devices utilized, owned, or operated by the first and second users, respectively. Upon pairing, media content from the first and second users' media content libraries can be shared. Additionally, proximate location can be leveraged to surface media content to other users, giving such other users the opportunity to discover new media content, and otherwise engage in transactions involving the new media content. Further still, the most popular media content associated with the second user or group of users proximate to the first user can be determined and used to prompt further interaction or display information regarding such popular media content.Type: ApplicationFiled: June 19, 2015Publication date: September 8, 2016Applicant: Disney Enterprises, Inc.Inventors: CHRISTOPHER S. TAYLOR, Mark Arana, Josiah Eatedali, Edward Drake, ANN LEE, Anthony Mutalipassi
-
Publication number: 20150286723Abstract: Systems, methods, and computer-readable storage media are provided for identifying dominant entity categories associated with target entities. A target entity is received and plural data sources are utilized to determine entity categories of which the target entity is a member and an initial confidence score for each of the entity categories. Each initial confidence score represents the likelihood that the associated entity category is a dominant category for the target entity. At least one data source includes information pertaining to plural entities arranged in a graph-based ontology that includes identifiers of respective entity categories of which the subject entities are members. Graph-based confidence score propagation is then utilized to incorporate information regarding entities determined to be related to the target entity and accolades associated with the target entity to alter the initial confidence scores provided for various entity categories of which the target entity is a member.Type: ApplicationFiled: April 7, 2014Publication date: October 8, 2015Applicant: MICROSOFT CORPORATIONInventors: WALTER SUN, HUNG-AN CHANG, JINGFENG LI, ANN LEE
-
Patent number: 7573988Abstract: A computer-implemented system is provided, including a network consisting of the Internet, PSTN, and CATV network. The network is connected to multiple users' client systems, and also to multiple service provider systems. Each user has a user profile stored in a User Preference Database, while each service provider has service parameters defining the type of its service stored in a Service Database, both of which are connected to the network. The system further includes a Gatekeeper Server, which establishes voice communication among the client systems and the service provider systems. In operation, upon receiving a user's request for service, the Gatekeeper Server identifies one or more service providers whose service parameters match the user's request for service and the user's profile. Upon the user's selection of one such service provider, the Gatekeeper server automatically selects and establishes a preferred mode of voice connection between the user and the selected service provider.Type: GrantFiled: June 2, 2004Date of Patent: August 11, 2009Assignee: DynaLab Inc.Inventors: Fisher Chen-yin Lee, Ann Lee, David Liu
-
Publication number: 20080118970Abstract: A process for purifying virus particles, especially recombinant adenovirus vector particles, is presented. The process relies on various combinations of cell lysis, detergent-based precipitation of host cell contaminants away from the virus, depth filtration or centrifugation, ultrafiltration, nuclease digestion and chromatography to robustly and economically produce highly purified product. This process results in contaminating DNA levels which are consistently below detectable levels.Type: ApplicationFiled: December 7, 2007Publication date: May 22, 2008Inventors: John Konz, Ann Lee, Chi To, Aaron Goerke
-
Publication number: 20060004753Abstract: The present invention is directed to a method and computer system for representing a dataset comprising N documents by computing a diffusion geometry of the dataset comprising at least a plurality of diffusion coordinates. The present method and system stores a number of diffusion coordinates, wherein the number is linear in proportion to N.Type: ApplicationFiled: June 23, 2005Publication date: January 5, 2006Inventors: Ronald Coifman, Andreas Coppi, Frank Geshwind, Stephane Lafon, Ann Lee, Mauro Maggioni, Frederick Warner, Steven Zucker, William Fateley
-
Publication number: 20050286711Abstract: A computer-implemented system is provided, including a network consisting of the Internet, PSTN, and CATV network. The network is connected to multiple users' client systems, and also to multiple service provider systems. Each user has a user profile stored in a User Preference Database, while each service provider has service parameters defining the type of its service stored in a Service Database, both of which are connected to the network. The system further includes a Gatekeeper Server, which establishes voice communication among the client systems and the service provider systems. In operation, upon receiving a user's request for service, the Gatekeeper Server identifies one or more service providers whose service parameters match the user's request for service and the user's profile. Upon the user's selection of one such service provider, the Gatekeeper server automatically selects and establishes a preferred mode of voice connection between the user and the selected service provider.Type: ApplicationFiled: June 2, 2004Publication date: December 29, 2005Inventors: Fisher Lee, Ann Lee, David Liu
-
Publication number: 20050196854Abstract: A process for purifying virus particles, especially recombinant adenovirus vector particles, is presented. The process relies on various combinations of cell lysis, detergent-based precipitation of host cell contaminants away from the virus, depth filtration or centrifugation, ultrafiltration, nuclease digestion and chromatography to robustly and economically produce highly purified product. This process results in contaminating DNA levels which are consistently below detectable levels.Type: ApplicationFiled: May 13, 2003Publication date: September 8, 2005Inventors: John Konz, Ann Lee, Chi To, Aaron Goerke
-
Publication number: 20050153420Abstract: A process for purifying virus particles, especially recombinant adenovirus vector particles, is presented. The process relies on various combinations of cell lysis, detergent-based precipitation of host cell contaminants away from the virus, depth filtration or centrifugation, ultrafiltration, nuclease digestion and chromatography to robustly and economically produce highly purified product. This process results in contaminating DNA levels which are consistently below detectable levels.Type: ApplicationFiled: May 13, 2003Publication date: July 14, 2005Inventors: John Konz Jr., Ann Lee, Chin To, Aaron Goerke
-
Publication number: 20050108406Abstract: The present invention provides a method, system, and software for dynamically generating a customized menu page on a display of a client system coupled to a network, such as the Internet. The menu page includes a number of selectable icons, each associated with a particular Web site, service, Web guide channel, etc., that the user is likely to wish to access. The menu page is “customized” in the sense that each menu page is generated based on each user's network log history and preferences, as stored in a user preference database, so as to present only those information sources/services that he/she would want to access. The menu is “dynamically” generated in the sense that the user preference database is constantly updated so as to present a menu page that reflects the user's most recent preferences and history.Type: ApplicationFiled: November 7, 2003Publication date: May 19, 2005Inventors: Fisher Lee, Ann Lee