Patents by Inventor Vitor Rocha de Carvalho
Vitor Rocha de Carvalho has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 12164603Abstract: A machine learning based system can identify an entity as the likely subject of a multimodal message (e.g., a social media post having a short text phrase overlaid on an image) by creating embeddings for an image of the multimodal message and one or more string embeddings from text of the multimodal message. The embeddings can be weighted to maximize information gain, then recombined and compared against a result embedding database to identify an entity as the subject of the multimodal message.Type: GrantFiled: September 15, 2022Date of Patent: December 10, 2024Assignee: Snap Inc.Inventors: Vitor Rocha de Carvalho, Leonardo Ribas Machado das Neves, Seungwhan Moon
-
Patent number: 12155612Abstract: A caption of a multimodal message (e.g., social media post) can be identified as a named entity using an entity recognition system. The entity recognition system can use an attention-based mechanism that emphasis or de-emphasizes each data type (e.g., image, word, character) in the multimodal message based on each datatypes relevance. The output of the attention mechanism can be used to update a recurrent network to identify one or more words in the caption as being a named entity.Type: GrantFiled: July 25, 2023Date of Patent: November 26, 2024Assignee: Snap Inc.Inventors: Vitor Rocha de Carvalho, Leonardo Ribas Machado das Neves, Seungwhan Moon
-
Publication number: 20240354508Abstract: A caption of a multimodal message (e.g., social media post) can be identified as a named entity using an entity recognition system. The entity recognition system can use a visual attention based mechanism to generate a visual context representation from an image and caption. The system can use the visual context representation to identify one or more terms of the caption as a named entity.Type: ApplicationFiled: July 1, 2024Publication date: October 24, 2024Inventors: Di Lu, Leonardo Ribas Machado das Neves, Vitor Rocha de Carvalho, Ning Zhang
-
Patent number: 12056454Abstract: A caption of a multimodal message (e.g., social media post) can be identified as a named entity using an entity recognition system. The entity recognition system can use a visual attention based mechanism to generate a visual context representation from an image and caption. The system can use the visual context representation to identify one or more terms of the caption as a named entity.Type: GrantFiled: May 23, 2023Date of Patent: August 6, 2024Assignee: Snap Inc.Inventors: Di Lu, Leonardo Ribas Machado das Neves, Vitor Rocha de Carvalho, Ning Zhang
-
Patent number: 12026463Abstract: Systems, devices, media, and methods are presented for generating a language detection model of a language analysis system. The systems and methods access a set of messages including text elements and convert the set of messages into a set of training messages. The set of training messages are configured for training a language detection model. The systems and methods train a classifier based on the set of training messages. The classifier has a set of features representing word frequency, character frequency, and a character ratio. The systems and methods generate a language detection model based on the classifier and the set of features.Type: GrantFiled: May 25, 2023Date of Patent: July 2, 2024Assignee: Snap Inc.Inventors: Vitor Rocha de Carvalho, Luis Carlos Dos Santos Marujo, Leonardo Ribas Machado das Neves
-
Publication number: 20240037141Abstract: Systems, methods, devices, media, and computer readable instructions are described for local image tagging in a resource constrained environment. One embodiment involves processing image data using a deep convolutional neural network (DCNN) comprising at least a first subgraph and a second subgraph, the first subgraph comprising at least a first layer and a second layer, processing, the image data using at least the first layer of the first subgraph to generate first intermediate output data; processing, by the mobile device, the first intermediate output data using at least the second layer of the first subgraph to generate first subgraph output data, and in response to a determination that each layer reliant on the first intermediate data have completed processing, deleting the first intermediate data from the mobile device. Additional embodiments involve convolving entire pixel resolutions of the image data against kernels in different layers if the DCNN.Type: ApplicationFiled: October 10, 2023Publication date: February 1, 2024Inventors: Xiaoyu Wang, Ning Xu, Ning Zhang, Vitor Rocha de Carvalho, Jia Li
-
Publication number: 20240022532Abstract: A caption of a multimodal message (e.g., social media post) can be identified as a named entity using an entity recognition system. The entity recognition system can use an attention-based mechanism that emphasis or de-emphasizes each data type (e.g., image, word, character) in the multimodal message based on each datatypes relevance. The output of the attention mechanism can be used to update a recurrent network to identify one or more words in the caption as being a named entity.Type: ApplicationFiled: July 25, 2023Publication date: January 18, 2024Inventors: Vitor Rocha de Carvalho, Leonardo Ribas Machado das Neves, Seungwhan Moon
-
Publication number: 20230385551Abstract: A caption of a multimodal message (e.g., social media post) can be identified as a named entity using an entity recognition system. The entity recognition system can use a visual attention based mechanism to generate a visual context representation from an image and caption. The system can use the visual context representation to identify one or more terms of the caption as a named entity.Type: ApplicationFiled: May 23, 2023Publication date: November 30, 2023Inventors: Di Lu, Leonardo Ribas Machado das Neves, Vitor Rocha de Carvalho, Ning Zhang
-
Publication number: 20230297775Abstract: Systems, devices, media, and methods are presented for generating a language detection model of a language analysis system. The systems and methods access a set of messages including text elements and convert the set of messages into a set of training messages. The set of training messages are configured for training a language detection model. The systems and methods train a classifier based on the set of training messages. The classifier has a set of features representing word frequency, character frequency, and a character ratio. The systems and methods generate a language detection model based on the classifier and the set of features.Type: ApplicationFiled: May 25, 2023Publication date: September 21, 2023Inventors: Vitor Rocha de Carvalho, Luis Carlos Dos Santos Marujo, Leonardo Ribas Machado das Neves
-
Patent number: 11750547Abstract: A caption of a multimodal message (e.g., social media post) can be identified as a named entity using an entity recognition system. The entity recognition system can use an attention-based mechanism that emphasis or de-emphasizes each data type (e.g., image, word, character) in the multimodal message based on each datatypes relevance. The output of the attention mechanism can be used to update a recurrent network to identify one or more words in the caption as being a named entity.Type: GrantFiled: August 27, 2021Date of Patent: September 5, 2023Assignee: Snap Inc.Inventors: Vitor Rocha de Carvalho, Leonardo Ribas Machado das Neves, Seungwhan Moon
-
Patent number: 11704488Abstract: Systems, devices, media, and methods are presented for generating a language detection model of a language analysis system. The systems and methods access a set of messages including text elements and convert the set of messages into a set of training messages. The set of training messages are configured for training a language detection model. The systems and methods train a classifier based on the set of training messages. The classifier has a set of features representing word frequency, character frequency, and a character ratio. The systems and methods generate a language detection model based on the classifier and the set of features.Type: GrantFiled: December 7, 2021Date of Patent: July 18, 2023Assignee: Snap Inc.Inventors: Vitor Rocha de Carvalho, Luis Carlos Dos Santos Marujo, Leonardo Ribas Machado das Neves
-
Patent number: 11687720Abstract: A caption of a multimodal message (e.g., social media post) can be identified as a named entity using an entity recognition system. The entity recognition system can use a visual attention based mechanism to generate a visual context representation from an image and caption. The system can use the visual context representation to identify one or more terms of the caption as a named entity.Type: GrantFiled: May 3, 2021Date of Patent: June 27, 2023Assignee: Snap Inc.Inventors: Di Lu, Leonardo Ribas Machado das Neves, Vitor Rocha de Carvalho, Ning Zhang
-
Publication number: 20230013116Abstract: A machine learning based system can identify an entity as the likely subject of a multimodal message (e.g., a social media post having a short text phrase overlaid on an image) by creating embeddings for an image of the multimodal message and one or more string embeddings from text of the multimodal message. The embeddings can be weighted to maximize information gain, then recombined and compared against a result embedding database to identify an entity as the subject of the multimodal message.Type: ApplicationFiled: September 15, 2022Publication date: January 19, 2023Inventors: Vitor Rocha de Carvalho, Leonardo Ribas Machado das Neves, Seungwhan Moon
-
Patent number: 11475254Abstract: A machine learning based system can identify an entity as the likely subject of a multimodal message (e.g., a social media post having a short text phrase overlaid on an image) by creating embeddings for an image of the multimodal message and one or more string embeddings from text of the multimodal message. The embeddings can be weighted to maximize information gain, then recombined and compared against a result embedding database to identify an entity as the subject of the multimodal message.Type: GrantFiled: September 7, 2018Date of Patent: October 18, 2022Assignee: Snap Inc.Inventors: Vitor Rocha de Carvalho, Leonardo Ribas Machado das Neves, Seungwhan Moon
-
Publication number: 20220092261Abstract: Systems, devices, media, and methods are presented for generating a language detection model of a language analysis system. The systems and methods access a set of messages including text elements and convert the set of messages into a set of training messages. The set of training messages are configured for training a language detection model. The systems and methods train a classifier based on the set of training messages. The classifier has a set of features representing word frequency, character frequency, and a character ratio. The systems and methods generate a language detection model based on the classifier and the set of features.Type: ApplicationFiled: December 7, 2021Publication date: March 24, 2022Inventors: Vitor Rocha de Carvalho, Luis Carlos Dos Santos Marujo, Leonardo Ribas Machado das Neves
-
Patent number: 11210467Abstract: Systems, devices, media, and methods are presented for generating a language detection model of a language analysis system. The systems and methods access a set of messages including text elements and convert the set of messages into a set of training messages. The set of training messages are configured for training a language detection model. The systems and methods train a classifier based on the set of training messages. The classifier has a set of features representing word frequency, character frequency, and a character ratio. The systems and methods generate a language detection model based on the classifier and the set of features.Type: GrantFiled: April 13, 2018Date of Patent: December 28, 2021Assignee: Snap Inc.Inventors: Vitor Rocha de Carvalho, Luis Carlos Dos Santos Marujo, Leonardo Ribas Machado das Neves
-
Publication number: 20210390411Abstract: A caption of a multimodal message (e.g., social media post) can be identified as a named entity using an entity recognition system. The entity recognition system can use an attention-based mechanism that emphasis or de-emphasizes each data type (e.g., image, word, character) in the multimodal message based on each datatypes relevance. The output of the attention mechanism can be used to update a recurrent network to identify one or more words in the caption as being a named entity.Type: ApplicationFiled: August 27, 2021Publication date: December 16, 2021Inventors: Vitor Rocha de Carvalho, Leonardo Ribas Machado das Neves, Seungwhan Moon
-
Patent number: 11120334Abstract: A caption of a multimodal message (e.g., social media post) can be identified as a named entity using an entity recognition system. The entity recognition system can use an attention-based mechanism that emphasis or de-emphasizes each data type (e.g., image, word, character) in the multimodal message based on each datatypes relevance. The output of the attention mechanism can be used to update a recurrent network to identify one or more words in the caption as being a named entity.Type: GrantFiled: September 7, 2018Date of Patent: September 14, 2021Assignee: Snap Inc.Inventors: Vitor Rocha de Carvalho, Leonardo Ribas Machado das Neves, Seungwhan Moon
-
Publication number: 20210256213Abstract: A caption of a multimodal message (e.g., social media post) can be identified as a named entity using an entity recognition system. The entity recognition system can use a visual attention based mechanism to generate a visual context representation from an image and caption. The system can use the visual context representation to identify one or more terms of the caption as a named entity.Type: ApplicationFiled: May 3, 2021Publication date: August 19, 2021Inventors: Di Lu, Leonardo Ribas Machado das Neves, Vitor Rocha de Carvalho, Ning Zhang
-
Patent number: 11017173Abstract: A caption of a multimodal message (e.g., social media post) can be identified as a named entity using an entity recognition system. The entity recognition system can use a visual attention based mechanism to generate a visual context representation from an image and caption. The system can use the visual context representation to identify one or more terms of the caption as a named entity.Type: GrantFiled: December 21, 2018Date of Patent: May 25, 2021Assignee: Snap Inc.Inventors: Di Lu, Leonardo Ribas Machado das Neves, Vitor Rocha de Carvalho, Ning Zhang