Patents by Inventor William Chan
William Chan has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 12387096Abstract: A method includes receiving training data comprising a plurality of pairs of images. Each pair comprises a noisy image and a denoised version of the noisy image. The method also includes training a multi-task diffusion model to perform a plurality of image-to-image translation tasks, wherein the training comprises iteratively generating a forward diffusion process by predicting, at each iteration in a sequence of iterations and based on a current noisy estimate of the denoised version of the noisy image, noise data for a next noisy estimate of the denoised version of the noisy image, updating, at each iteration, the current noisy estimate to the next noisy estimate by combining the current noisy estimate with the predicted noise data, and determining a reverse diffusion process by inverting the forward diffusion process to predict the denoised version of the noisy image. The method additionally includes providing the trained diffusion model.Type: GrantFiled: October 5, 2022Date of Patent: August 12, 2025Assignee: Google LLCInventors: Chitwan Saharia, Mohammad Norouzi, William Chan, Huiwen Chang, David James Fleet, Christopher Albert Lee, Jonathan Ho, Tim Salimans
-
Publication number: 20250209806Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium. In one aspect, a method includes receiving a text prompt describing a scene; processing the text prompt using a text encoder neural network to generate a contextual embedding of the text prompt; and processing the contextual embedding using a sequence of generative neural networks to generate a final video depicting the scene.Type: ApplicationFiled: March 13, 2025Publication date: June 26, 2025Inventors: Jonathan Ho, William Chan, Chitwan Saharia, Jay Ha Whang, Tim Salimans
-
Patent number: 12277758Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium. In one aspect, a method includes receiving a text prompt describing a scene; processing the text prompt using a text encoder neural network to generate a contextual embedding of the text prompt; and processing the contextual embedding using a sequence of generative neural networks to generate a final video depicting the scene.Type: GrantFiled: December 29, 2023Date of Patent: April 15, 2025Assignee: Google LLCInventors: Jonathan Ho, William Chan, Chitwan Saharia, Jay Ha Whang, Tim Salimans
-
Patent number: 12242818Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for sequence modeling. One of the methods includes receiving an input sequence having a plurality of input positions; determining a plurality of blocks of consecutive input positions; processing the input sequence using a neural network to generate a latent alignment, comprising, at each of a plurality of input time steps: receiving a partial latent alignment from a previous input time step; selecting an input position in each block, wherein the token at the selected input position of the partial latent alignment in each block is a mask token; and processing the partial latent alignment and the input sequence using the neural network to generate a new latent alignment, wherein the new latent alignment comprises, at the selected input position in each block, an output token or a blank token; and generating, using the latent alignment, an output sequence.Type: GrantFiled: February 8, 2021Date of Patent: March 4, 2025Assignee: Google LLCInventors: William Chan, Chitwan Saharia, Geoffrey E. Hinton, Mohammad Norouzi, Navdeep Jaitly
-
Publication number: 20250061551Abstract: A method includes receiving, by a computing device, training data comprising a plurality of pairs of images, wherein each pair comprises an image and at least one corresponding target version of the image. The method also includes training a neural network based on the training data to predict an enhanced version of an input image, wherein the training of the neural network comprises applying a forward Gaussian diffusion process that adds Gaussian noise to the at least one corresponding target version of each of the plurality of pairs of images to enable iterative denoising of the input image, wherein the iterative denoising is based on a reverse Markov chain associated with the forward Gaussian diffusion process. The method additionally includes outputting the trained neural network.Type: ApplicationFiled: November 7, 2024Publication date: February 20, 2025Inventors: Chitwan Saharia, Jonathan Ho, William Chan, Tim Salimans, David Fleet, Mohammad Norouzi
-
Publication number: 20240428194Abstract: A clinical documentation improvement (CDI) smart scoring method may include predicting, via per-condition diagnosis machine learning (ML) models and based on clinical evidence received by a system, a probability that a medical condition is under-documented and, via per-condition documentation ML models and based on documentation received by the system, a probability that a medical condition is over-documented. The under- and over-documentation scores are combined in view of special indicators and queryability factors, which can also be evaluated using ML query prediction models, to generate an initial CDI score. This CDI score can be further adjusted, if necessary or desired, to account for factors such as length of stay, payer, patient location, CDI review timing, etc. The final CDI score can be used to prioritize patient cases for review by CDI specialists to quickly and efficiently identify meaningful CDI opportunities.Type: ApplicationFiled: September 5, 2024Publication date: December 26, 2024Inventors: William Chan, W. Lance Eason, Timothy Harper, Bryan Home, Michael Kadyan, Jonathan Matthews, Joshua Toub
-
Patent number: 12165289Abstract: A method includes receiving, by a computing device, training data comprising a plurality of pairs of images, wherein each pair comprises an image and at least one corresponding target version of the image. The method also includes training a neural network based on the training data to predict an enhanced version of an input image, wherein the training of the neural network comprises applying a forward Gaussian diffusion process that adds Gaussian noise to the at least one corresponding target version of each of the plurality of pairs of images to enable iterative denoising of the input image, wherein the iterative denoising is based on a reverse Markov chain associated with the forward Gaussian diffusion process. The method additionally includes outputting the trained neural network.Type: GrantFiled: July 27, 2023Date of Patent: December 10, 2024Assignee: Google LLCInventors: Chitwan Saharia, Jonathan Ho, William Chan, Tim Salimans, David Fleet, Mohammad Norouzi
-
Publication number: 20240338936Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating an output video conditioned on an input. In one aspect, a method comprises receiving the input; initializing a current intermediate representation; generating an output video by updating the current intermediate representation at each of a plurality of iterations, wherein the updating comprises, at each iteration: processing an intermediate input for the iteration comprising the current intermediate representation using a diffusion model that is configured to process the intermediate input to generate a noise output; and updating the current intermediate representation using the noise output for the iteration.Type: ApplicationFiled: April 6, 2023Publication date: October 10, 2024Inventors: Jonathan Ho, Tim Salimans, Alexey Alexeevich Gritsenko, William Chan, Mohammad Norouzi, David James Fleet
-
Patent number: 12112296Abstract: A clinical documentation improvement (CDI) smart scoring method may include predicting, via per-condition diagnosis machine learning (ML) models and based on clinical evidence received by a system, a probability that a medical condition is under-documented and, via per-condition documentation ML models and based on documentation received by the system, a probability that a medical condition is over-documented. The under- and over-documentation scores are combined in view of special indicators and queryability factors, which can also be evaluated using ML query prediction models, to generate an initial CDI score. This CDI score can be further adjusted, if necessary or desired, to account for factors such as length of stay, payer, patient location, CDI review timing, etc. The final CDI score can be used to prioritize patient cases for review by CDI specialists to quickly and efficiently identify meaningful CDI opportunities.Type: GrantFiled: August 16, 2023Date of Patent: October 8, 2024Assignee: IODINE SOFTWARE, LLCInventors: William Chan, W. Lance Eason, Timothy Harper, Bryan Horne, Michael Kadyan, Jonathan Matthews, Joshua Toub
-
Patent number: 12106064Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating network outputs using insertion operations.Type: GrantFiled: December 15, 2022Date of Patent: October 1, 2024Assignee: Google LLCInventors: Jakob D. Uszkoreit, Mitchell Thomas Stern, Jamie Ryan Kiros, William Chan
-
Publication number: 20240320965Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium. In one aspect, a method includes receiving a text prompt describing a scene; processing the text prompt using a text encoder neural network to generate a contextual embedding of the text prompt; and processing the contextual embedding using a sequence of generative neural networks to generate a final video depicting the scene.Type: ApplicationFiled: December 29, 2023Publication date: September 26, 2024Inventors: Jonathan Ho, William Chan, Chitwan Saharia, Jay Ha Whang, Tim Salimans
-
Patent number: 12100391Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media for speech recognition. One method includes obtaining an input acoustic sequence, the input acoustic sequence representing an utterance, and the input acoustic sequence comprising a respective acoustic feature representation at each of a first number of time steps; processing the input acoustic sequence using a first neural network to convert the input acoustic sequence into an alternative representation for the input acoustic sequence; processing the alternative representation for the input acoustic sequence using an attention-based Recurrent Neural Network (RNN) to generate, for each position in an output sequence order, a set of substring scores that includes a respective substring score for each substring in a set of substrings; and generating a sequence of substrings that represent a transcription of the utterance.Type: GrantFiled: October 7, 2021Date of Patent: September 24, 2024Assignee: Google LLCInventors: William Chan, Navdeep Jaitly, Quoc V. Le, Oriol Vinyals, Noam M. Shazeer
-
Publication number: 20240303673Abstract: Fraud detection model performance is represented as compressed data in the form of polynomial curve coefficients. A data compression setting, a set of independent variable values, and a set of dependent variable values are used in polynomial regression to generate coefficients of a polynomial curve. The data compression setting is related to the order of the polynomial, for example set to the degrees of freedom (DOF) defined as the polynomial order. A lower DOF yields a higher error, but with a higher degree of compression. The lowest DOF with a tolerable error is selected and the polynomial coefficients are transmitted to a remote node. The remote node regenerates the polynomial curve for comparison with a polynomial curve from a prior time period, in order to determine a performance trend. The trend is used to either generate an alert or trigger further training of the fraud detection model.Type: ApplicationFiled: March 9, 2023Publication date: September 12, 2024Inventor: William CHAN
-
Patent number: 12086715Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for performing sequence modeling tasks using insertions. One of the methods includes receiving a system input that includes one or more source elements from a source sequence and zero or more target elements from a target sequence, wherein each source element is selected from a vocabulary of source elements and wherein each target element is selected from a vocabulary of target elements; generating a partial concatenated sequence that includes the one or more source elements from the source sequence and the zero or more target elements from the target sequence, wherein the source and target elements arranged in the partial concatenated sequence according to a combined order; and generating a final concatenated sequence that includes a finalized source sequence and a finalized target sequence, wherein the finalized target sequence includes one or more target elements.Type: GrantFiled: May 22, 2023Date of Patent: September 10, 2024Assignee: Google LLCInventors: William Chan, Mitchell Thomas Stern, Nikita Kitaev, Kelvin Gu, Jakob D. Uszkoreit
-
Publication number: 20240249456Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating images. In one aspect, a method includes: receiving an input text prompt including a sequence of text tokens in a natural language; processing the input text prompt using a text encoder neural network to generate a set of contextual embeddings of the input text prompt; and processing the contextual embeddings through a sequence of generative neural networks to generate a final output image that depicts a scene that is described by the input text prompt.Type: ApplicationFiled: April 2, 2024Publication date: July 25, 2024Inventors: Chitwan Saharia, William Chan, Mohammad Norouzi, Saurabh Saxena, Yi Li, Jay Ha Whang, David James Fleet, Jonathan Ho
-
Publication number: 20240249084Abstract: Disclosed are systems and methods for a novel machine translation as a service (MTaS) framework that enables real-time machine translation to realize improved accuracy without degrading the speed in which a translator can operate. The disclosed translator technology can provide an improved, computationally efficient and accurate system that can improve how translations are provided, which can improve how translation-based processes are performed. The disclosed framework enables translation string gaps between translated releases to be identified, filled and/or corrected. Such automated and/or selected translation correction, modification and/or fine-tuning can be enabled via any type of translation model and/or model combination.Type: ApplicationFiled: January 25, 2024Publication date: July 25, 2024Inventors: Aarthy Thasma Ravindran, William Chan, Jian He, Forrest Scott
-
Patent number: 11978141Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating images. In one aspect, a method includes: receiving an input text prompt including a sequence of text tokens in a natural language; processing the input text prompt using a text encoder neural network to generate a set of contextual embeddings of the input text prompt; and processing the contextual embeddings through a sequence of generative neural networks to generate a final output image that depicts a scene that is described by the input text prompt.Type: GrantFiled: May 19, 2023Date of Patent: May 7, 2024Assignee: Google LLCInventors: Chitwan Saharia, William Chan, Mohammad Norouzi, Saurabh Saxena, Yi Li, Jay Ha Whang, David James Fleet, Jonathan Ho
-
Patent number: 11955213Abstract: In clinical documentation, mere documentation of a condition in a patient's records may not be enough. To be considered sufficiently documented, the patient's record needs to show that no documentation drop-offs (DDOs) have occurred over the course of the patient's stay. However, DDOs can be extremely difficult to detect. To solve this problem, the invention trains time-sensitive deep learning (DL) models on a per condition basis using actual and/or synthetic patient data. Utilizing an ontology, grouped concepts can be generated on the fly from real-time hospital data and used to generate time-series data that can then be analyzed by trained time-sensitive DL models to determine whether a DDO for a condition has occurred during the stay. Non-time-sensitive models can be used to detect all the conditions documented during the stay. Outcomes from the models can be compared to determine whether to notify a user that a DDO has occurred.Type: GrantFiled: February 13, 2023Date of Patent: April 9, 2024Assignee: IODINE SOFTWARE, LLCInventors: Jonathan Matthews, W. Lance Eason, William Chan, Michael Kadyan, Frances Elizabeth Jurcak, Timothy Paul Harper
-
Patent number: 11908180Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium. In one aspect, a method includes receiving a text prompt describing a scene; processing the text prompt using a text encoder neural network to generate a contextual embedding of the text prompt; and processing the contextual embedding using a sequence of generative neural networks to generate a final video depicting the scene.Type: GrantFiled: March 24, 2023Date of Patent: February 20, 2024Assignee: Google LLCInventors: Jonathan Ho, William Chan, Chitwan Saharia, Jay Ha Whang, Tim Salimans
-
Publication number: 20240028893Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for performing sequence modeling tasks using insertions. One of the methods includes receiving a system input that includes one or more source elements from a source sequence and zero or more target elements from a target sequence, wherein each source element is selected from a vocabulary of source elements and wherein each target element is selected from a vocabulary of target elements; generating a partial concatenated sequence that includes the one or more source elements from the source sequence and the zero or more target elements from the target sequence, wherein the source and target elements arranged in the partial concatenated sequence according to a combined order; and generating a final concatenated sequence that includes a finalized source sequence and a finalized target sequence, wherein the finalized target sequence includes one or more target elements.Type: ApplicationFiled: May 22, 2023Publication date: January 25, 2024Inventors: William Chan, Mitchell Thomas Stern, Nikita Kitaev, Kelvin Gu, Jakob D. Uszkoreit