Patents by Inventor Ermo Wei
Ermo Wei has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 12373391Abstract: The present disclosure is directed toward systems, methods, and non-transitory computer-readable media for generating a dynamic facet by using a large language model. For example, the disclosed systems extract raw facet data from a plurality of content items stored in a content management system. In addition, the disclosed systems determine one or more facet content groups by grouping the plurality of content items according to the raw facet data. Further, the disclosed systems generate a facet prompt from the one or more facet content groups. Moreover, the disclosed systems generate a dynamic facet by providing the facet prompt to a large language model.Type: GrantFiled: July 29, 2024Date of Patent: July 29, 2025Assignee: Dropbox, Inc.Inventors: Wesley Liao, Ermo Wei, Aleksander Dash, Morgan Zerby, Simon Shubbar, Prasang Upadhyaya, Maor Bar Asher, Jiarui Ding
-
Publication number: 20240320288Abstract: The present disclosure relates to systems, methods, and non-transitory computer-readable media that utilize machine learning models to generate identifier embeddings from digital content identifiers and then leverage these identifier embeddings to determine digital connections between digital content items. In particular, the disclosed systems can utilize an embedding machine-learning model that comprises a character-level embedding machine-learning model and a word-level embedding machine-learning model. For example, the disclosed systems can combine a character embedding from the character-level embedding machine-learning model and a token embedding from the word-level embedding machine-learning model. The disclosed systems can determine digital connections between the plurality of digital content items by processing these identifier embeddings for a plurality of digital content items utilizing a content management model.Type: ApplicationFiled: May 31, 2024Publication date: September 26, 2024Inventors: Jongmin Baek, Jiarui Ding, Ermo Wei, Scott McCrae
-
Patent number: 12008065Abstract: The present disclosure relates to systems, methods, and non-transitory computer-readable media that utilize machine learning models to generate identifier embeddings from digital content identifiers and then leverage these identifier embeddings to determine digital connections between digital content items. In particular, the disclosed systems can utilize an embedding machine-learning model that comprises a character-level embedding machine-learning model and a word-level embedding machine-learning model. For example, the disclosed systems can combine a character embedding from the character-level embedding machine-learning model and a token embedding from the word-level embedding machine-learning model. The disclosed systems can determine digital connections between the plurality of digital content items by processing these identifier embeddings for a plurality of digital content items utilizing a content management model.Type: GrantFiled: January 12, 2023Date of Patent: June 11, 2024Assignee: Dropbox, Inc.Inventors: Jongmin Baek, Jiarui Ding, Ermo Wei, Scott McCrae
-
Publication number: 20240054146Abstract: One or more embodiments of a synchronization system facilitate selectivity synchronizing digital content items from a collection of digital content items to a local storage of a client device. In particular, one or more embodiments described herein collect and analyze recall data for users of a digital content management system with respect to digital content items to determine synchronization scores for the digital content items. One or more embodiments described herein further include selectively identifying a subset of the digital content items based on the synchronization scores to recommend for synchronization to a local storage of a client device.Type: ApplicationFiled: October 26, 2023Publication date: February 15, 2024Inventors: Ermo Wei, Jialiang Li, Kaiyue Sun, Li Chen Koh, Mingye Xia, Yu Zhang, Yuyang Guo
-
Patent number: 11853817Abstract: The present disclosure relates to systems, methods, and non-transitory computer-readable media that can leverage a natural language model to determine a most probable candidate sequence of tokens and thereby generate a predicted user activity. In particular, the disclosed systems can tokenize activity event vectors to generate a series of sequential tokens that correspond to recent user activity of one or more user accounts. In addition, the disclosed systems can, for each candidate (e.g., hypothetical) user activity, augment the series of sequential tokens to include a corresponding token. Based on respective probability scores for each of the augmented series of sequential tokens, the disclosed systems can identify as the predicted user activity, a candidate user activity corresponding to one of the augmented series of sequential tokens associated with a highest probability score. Based on the predicted user activity, the disclosed systems can surface one or more suggestions to a client device.Type: GrantFiled: January 18, 2023Date of Patent: December 26, 2023Assignee: Dropbox, Inc.Inventors: Ranjitha Gurunath Kulkarni, Xingyu Xiang, Jongmin Baek, Ermo Wei
-
Patent number: 11809450Abstract: One or more embodiments of a synchronization system facilitate selectivity synchronizing digital content items from a collection of digital content items to a local storage of a client device. In particular, one or more embodiments described herein collect and analyze recall data for users of a digital content management system with respect to digital content items to determine synchronization scores for the digital content items. One or more embodiments described herein further include selectively identifying a subset of the digital content items based on the synchronization scores to recommend for synchronization to a local storage of a client device.Type: GrantFiled: May 16, 2022Date of Patent: November 7, 2023Assignee: Dropbox, Inc.Inventors: Ermo Wei, Jialiang Li, Kaiyue Sun, Li Chen Koh, Mingye Xia, Yu Zhang, Yuyang Guo
-
Publication number: 20230185768Abstract: The present disclosure relates to systems, methods, and non-transitory computer-readable media that utilize machine-learning models to classify content items and automatically organize the content items within a file structure according to their content item classifications. For instance, a content item classification system generates one or more content item classification models to determine classifications for content items and/or folders. In some instances, the classification system detects when new content items are added to a smart folder, determines destination folders to which the content items belong based on classifying the content items, and automatically moves the content items accordingly. In various instances, the classification system generates and utilizes a classification model to organize content items into dynamically-generated folders.Type: ApplicationFiled: December 11, 2021Publication date: June 15, 2023Inventors: Tristan Inghelbrecht, Jongmin Baek, Ermo Wei, Morgan Zerby, Win Suen, Shubham Goel
-
Publication number: 20230186071Abstract: The present disclosure relates to systems, methods, and non-transitory computer-readable media that utilize machine-learning models to classify content items and automatically organize the content items within a file structure according to their content item classifications. For instance, a content item classification system generates one or more content item classification models to determine classifications for content items and/or folders. In some instances, the classification system detects when new content items are added to a smart folder, determines destination folders to which the content items belong based on classifying the content items, and automatically moves the content items accordingly. In various instances, the classification system generates and utilizes a classification model to organize content items into dynamically-generated folders.Type: ApplicationFiled: December 11, 2021Publication date: June 15, 2023Inventors: Tristan Frederick Rice, Jongmin Baek, Ermo Wei, Morgan Zerby, Win Suen, David Lichtenberg, Thomas Berg, Christopher Lesniewski-Laas, Brandon Obas, Mingming Liu, Zachary Smetana, Bryan Guillemette, Panashe Machinda Fundira, Kevin Li, Vidit Bhargava
-
Publication number: 20230169139Abstract: The present disclosure relates to systems, methods, and non-transitory computer-readable media that utilize machine learning models to generate identifier embeddings from digital content identifiers and then leverage these identifier embeddings to determine digital connections between digital content items. In particular, the disclosed systems can utilize an embedding machine-learning model that comprises a character-level embedding machine-learning model and a word-level embedding machine-learning model. For example, the disclosed systems can combine a character embedding from the character-level embedding machine-learning model and a token embedding from the word-level embedding machine-learning model. The disclosed systems can determine digital connections between the plurality of digital content items by processing these identifier embeddings for a plurality of digital content items utilizing a content management model.Type: ApplicationFiled: January 12, 2023Publication date: June 1, 2023Inventors: Jongmin Baek, Jiarui Ding, Ermo Wei, Scott McCrae
-
Patent number: 11568018Abstract: The present disclosure relates to systems, methods, and non-transitory computer-readable media that utilize machine learning models to generate identifier embeddings from digital content identifiers and then leverage these identifier embeddings to determine digital connections between digital content items. In particular, the disclosed systems can utilize an embedding machine-learning model that comprises a character-level embedding machine-learning model and a word-level embedding machine-learning model. For example, the disclosed systems can combine a character embedding from the character-level embedding machine-learning model and a token embedding from the word-level embedding machine-learning model. The disclosed systems can determine digital connections between the plurality of digital content items by processing these identifier embeddings for a plurality of digital content items utilizing a content management model.Type: GrantFiled: December 22, 2020Date of Patent: January 31, 2023Assignee: Dropbox, Inc.Inventors: Jongmin Baek, Jiarui Ding, Ermo Wei, Scott McCrae
-
Patent number: 11567812Abstract: The present disclosure relates to systems, methods, and non-transitory computer-readable media that can leverage a natural language model to determine a most probable candidate sequence of tokens and thereby generate a predicted user activity. In particular, the disclosed systems can tokenize activity event vectors to generate a series of sequential tokens that correspond to recent user activity of one or more user accounts. In addition, the disclosed systems can, for each candidate (e.g., hypothetical) user activity, augment the series of sequential tokens to include a corresponding token. Based on respective probability scores for each of the augmented series of sequential tokens, the disclosed systems can identify as the predicted user activity, a candidate user activity corresponding to one of the augmented series of sequential tokens associated with a highest probability score. Based on the predicted user activity, the disclosed systems can surface one or more suggestions to a client device.Type: GrantFiled: October 7, 2020Date of Patent: January 31, 2023Assignee: Dropbox, Inc.Inventors: Ranjitha Gurunath Kulkarni, Xingyu Xiang, Jongmin Baek, Ermo Wei
-
Publication number: 20220277020Abstract: One or more embodiments of a synchronization system facilitate selectivity synchronizing digital content items from a collection of digital content items to a local storage of a client device. In particular, one or more embodiments described herein collect and analyze recall data for users of a digital content management system with respect to digital content items to determine synchronization scores for the digital content items. One or more embodiments described herein further include selectively identifying a subset of the digital content items based on the synchronization scores to recommend for synchronization to a local storage of a client device.Type: ApplicationFiled: May 16, 2022Publication date: September 1, 2022Inventors: Ermo Wei, Jialiang Li, Kaiyue Sun, Li Chen Koh, Mingye Xia, Yu Zhang, Yuyang Guo
-
Publication number: 20220197961Abstract: The present disclosure relates to systems, methods, and non-transitory computer-readable media that utilize machine learning models to generate identifier embeddings from digital content identifiers and then leverage these identifier embeddings to determine digital connections between digital content items. In particular, the disclosed systems can utilize an embedding machine-learning model that comprises a character-level embedding machine-learning model and a word-level embedding machine-learning model. For example, the disclosed systems can combine a character embedding from the character-level embedding machine-learning model and a token embedding from the word-level embedding machine-learning model. The disclosed systems can determine digital connections between the plurality of digital content items by processing these identifier embeddings for a plurality of digital content items utilizing a content management model.Type: ApplicationFiled: December 22, 2020Publication date: June 23, 2022Inventors: Jongmin Baek, Jiarui Ding, Ermo Wei, Scott McCrae
-
Patent number: 11334596Abstract: One or more embodiments of a synchronization system facilitate selectivity synchronizing digital content items from a collection of digital content items to a local storage of a client device. In particular, one or more embodiments described herein collect and analyze recall data for users of a digital content management system with respect to digital content items to determine synchronization scores for the digital content items. One or more embodiments described herein further include selectively identifying a subset of the digital content items based on the synchronization scores to recommend for synchronization to a local storage of a client device.Type: GrantFiled: April 27, 2018Date of Patent: May 17, 2022Assignee: Dropbox, Inc.Inventors: Ermo Wei, Jialiang Li, Kaiyue Sun, Li Chen Koh, Mingye Xia, Yu Zhang, Yuyang Guo
-
Publication number: 20220107852Abstract: The present disclosure relates to systems, methods, and non-transitory computer-readable media that can leverage a natural language model to determine a most probable candidate sequence of tokens and thereby generate a predicted user activity. In particular, the disclosed systems can tokenize activity event vectors to generate a series of sequential tokens that correspond to recent user activity of one or more user accounts. In addition, the disclosed systems can, for each candidate (e.g., hypothetical) user activity, augment the series of sequential tokens to include a corresponding token. Based on respective probability scores for each of the augmented series of sequential tokens, the disclosed systems can identify as the predicted user activity, a candidate user activity corresponding to one of the augmented series of sequential tokens associated with a highest probability score. Based on the predicted user activity, the disclosed systems can surface one or more suggestions to a client device.Type: ApplicationFiled: October 7, 2020Publication date: April 7, 2022Inventors: Ranjitha Gurunath Kulkarni, Xingyu Xiang, Jongmin Baek, Ermo Wei
-
Publication number: 20190332710Abstract: One or more embodiments of a synchronization system facilitate selectivity synchronizing digital content items from a collection of digital content items to a local storage of a client device. In particular, one or more embodiments described herein collect and analyze recall data for users of a digital content management system with respect to digital content items to determine synchronization scores for the digital content items. One or more embodiments described herein further include selectively identifying a subset of the digital content items based on the synchronization scores to recommend for synchronization to a local storage of a client device.Type: ApplicationFiled: April 27, 2018Publication date: October 31, 2019Inventors: Ermo Wei, Jialiang Li, Kaiyue Sun, Li Chen Koh, Mingye Xia, Yu Zhang, Yuyang Guo