Patents Assigned to salesforce.com, inc.
-
Patent number: 11829442Abstract: Some embodiments of the current disclosure disclose methods and systems for batch active learning using the Shapley values of data points. In some embodiments, Shapley values of a first subset of labeled data are used to measure the contributions of the first subset of data to the performance of neural network. Further, a regression model that correlates the first subset of data to their Shapley values is trained to predict the Shapley values of a second subset of data that are unlabeled. A portion of the second subset of data may then be selected for labeling based on the predicted Shapley values.Type: GrantFiled: January 18, 2021Date of Patent: November 28, 2023Assignee: salesforce.com, inc.Inventors: Amirata Ghorbani, Carlos Andres Esteva
-
Patent number: 11829727Abstract: Approaches for cross-lingual regularization for multilingual generalization include a method for training a natural language processing (NLP) deep learning module. The method includes accessing a first dataset having a first training data entry, the first training data entry including one or more natural language input text strings in a first language; translating at least one of the one or more natural language input text strings of the first training data entry from the first language to a second language; creating a second training data entry by starting with the first training data entry and substituting the at least one of the natural language input text strings in the first language with the translation of the at least one of the natural language input text strings in the second language; adding the second training data entry to a second dataset; and training the deep learning module using the second dataset.Type: GrantFiled: April 23, 2021Date of Patent: November 28, 2023Assignee: salesforce.com, inc.Inventors: Jasdeep Singh, Nitish Shirish Keskar, Bryan McCann
-
Patent number: 11829721Abstract: Embodiments described herein provide dynamic blocking, a decoding algorithm which enables large-scale pretrained language models to generate high-quality paraphrases in an un-supervised setting. Specifically, in order to obtain an alternative surface form, when the language model emits a token that is present in the source sequence, the language model is prevented from generating the next token that is the same as the subsequent source token in the source sequence at the next time step. In this way, the language model is forced to generate a paraphrased sequence of the input source sequence, but with mostly different wording.Type: GrantFiled: January 28, 2021Date of Patent: November 28, 2023Assignee: salesforce.com, inc.Inventors: Tong Niu, Semih Yavuz, Yingbo Zhou, Nitish Shirish Keskar, Huan Wang, Caiming Xiong
-
Patent number: 11822897Abstract: Approaches for the translation of structured text include an embedding module for encoding and embedding source text in a first language, an encoder for encoding output of the embedding module, a decoder for iteratively decoding output of the encoder based on generated tokens in translated text from previous iterations, a beam module for constraining output of the decoder with respect to possible embedded tags to include in the translated text for a current iteration using a beam search, and a layer for selecting a token to be included in the translated text for the current iteration. The translated text is in a second language different from the first language. In some embodiments, the approach further includes scoring and pointer modules for selecting the token based on the output of the beam module or copied from the source text or reference text from a training pair best matching the source text.Type: GrantFiled: August 31, 2021Date of Patent: November 21, 2023Assignee: salesforce.com, inc.Inventors: Kazuma Hashimoto, Raffaella Buschiazzo, James Bradbury, Teresa Anna Marshall, Caiming Xiong, Richard Socher
-
Patent number: 11798534Abstract: Embodiments described herein provide an Adapt-and-Adjust (A2) mechanism for multilingual speech recognition model that combines both adaptation and adjustment methods as an integrated end-to-end training to improve the models' generalization and mitigate the long-tailed issue. Specifically, a multilingual language model mBERT is utilized, and converted into an autoregressive transformer decoder. In addition, a cross-attention module is added to the encoder on top of the mBERT's self-attention layer in order to explore the acoustic space in addition to the text space. The joint training of the encoder and mBERT decoder can bridge the semantic gap between the speech and the text.Type: GrantFiled: January 29, 2021Date of Patent: October 24, 2023Assignee: salesforce.com, inc.Inventors: Guangsen Wang, Chu Hong Hoi, Genta Indra Winata
-
Patent number: 11790184Abstract: Embodiments described herein provide natural language processing (NLP) systems and methods that provide a customized summarization of scientific or technical articles, which disentangles background information from new contributions, and summarizes the background information or the new information (or both) based on a user's preference. Specifically, the systems and methods utilize machine learning classifiers to classify portions of sentences within the article as containing background information or as containing a new contribution attributable to the article. The systems and methods then incorporate the background information in the summary or incorporate the new contribution in the summary and output the summary. In this way, the systems and methods can provide summaries of scientific literatures, which largely accelerates literature review in scientific fields.Type: GrantFiled: January 28, 2021Date of Patent: October 17, 2023Assignee: SALESFORCE.COM, INC.Inventors: Hiroaki Hayashi, Wojciech Kryscinski
-
Patent number: 11782686Abstract: Embodiments described herein a code generation and understanding model that builds on a Transformer-based encoder-decoder framework. The code generation and understanding model is configured to derive generic representations for programming language (PL) and natural language (NL) in code domain via pre-training on unlabeled code corpus, and then to benefit many code-related downstream tasks with fine-tuning. Apart from the denoising sequence-to-sequence objectives widely adopted for pre-training on natural language, identifier tagging and prediction pre-training objective is adopted to enable the model to better leverage the crucial token type information from PL, which specifically are the identifiers assigned by developers.Type: GrantFiled: August 27, 2021Date of Patent: October 10, 2023Assignee: SALESFORCE.COM, INC.Inventors: Yue Wang, Weishi Wang, Shafiq Rayhan Joty, Chu Hong Hoi
-
Patent number: 11783164Abstract: The technology disclosed provides a so-called “joint many-task neural network model” to solve a variety of increasingly complex natural language processing (NLP) tasks using growing depth of layers in a single end-to-end model. The model is successively trained by considering linguistic hierarchies, directly connecting word representations to all model layers, explicitly using predictions in lower tasks, and applying a so-called “successive regularization” technique to prevent catastrophic forgetting. Three examples of lower level model layers are part-of-speech (POS) tagging layer, chunking layer, and dependency parsing layer. Two examples of higher level model layers are semantic relatedness layer and textual entailment layer. The model achieves the state-of-the-art results on chunking, dependency parsing, semantic relatedness and textual entailment.Type: GrantFiled: October 26, 2020Date of Patent: October 10, 2023Assignee: Salesforce.com, Inc.Inventors: Kazuma Hashimoto, Caiming Xiong, Richard Socher
-
Patent number: 11776236Abstract: The system and method are directed to a prototypical contrastive learning (PCL). The PCL explicitly encodes the hierarchical semantic structure of the dataset into the learned embedding space and prevents the network from exploiting low-level cues for solving the unsupervised learning task. The PCL includes prototypes as the latent variables to help find the maximum-likelihood estimation of the network parameters in an expectation-maximization framework. The PCL iteratively performs an E-step for finding prototypes with clustering and M-step for optimizing the network on a contrastive loss.Type: GrantFiled: February 2, 2022Date of Patent: October 3, 2023Assignee: Salesforce.com, Inc.Inventors: Junnan Li, Chu Hong Hoi
-
Patent number: 11775775Abstract: Embodiments described herein provide a pipelined natural language question answering system that improves a BERT-based system. Specifically, the natural language question answering system uses a pipeline of neural networks each trained to perform a particular task. The context selection network identifies premium context from context for the question. The question type network identifies the natural language question as a yes, no, or span question and a yes or no answer to the natural language question when the question is a yes or no question. The span extraction model determines an answer span to the natural language question when the question is a span question.Type: GrantFiled: November 26, 2019Date of Patent: October 3, 2023Assignee: Salesforce.com, Inc.Inventors: Akari Asai, Kazuma Hashimoto, Richard Socher, Caiming Xiong
-
Publication number: 20230306008Abstract: Computing systems, database systems, and related methods are provided for managing data pertaining to electronic messages. A database system includes a database including a first object having a plurality of content fields corresponding to a first instance of an electronic message configured for a first communication channel and a server coupled to the database and a network to create a second database object corresponding to a second instance of the electronic message configured for a different communication channel, create a multichannel data structure in the database maintaining an association between the first and second objects, automatically populate a subset of content fields of the second object with values copied from the first object based on a mapping between the two communication channels, and thereafter use the second object to generate a version of the electronic message to be communicated to a recipient using the second communication channel.Type: ApplicationFiled: May 24, 2023Publication date: September 28, 2023Applicant: salesforce.com, inc.Inventors: Penny Tselikis, Thomas Besluau
-
Publication number: 20230308442Abstract: Devices, systems and methods are provided for remotely managing configuration of a networking device. One method of managing configuration of a networking device involves obtaining resources associated with one or more graphical user interface (GUI) displays of a web application for configuring the networking device, receiving a request to access the web application, and in response to the request, determining a representative value for the resources as a function of a current state of the resources, obtaining a reference value for the resources that reflects a validated state of the resources, and instantiating the web application using the resources when the representative value matches the reference value.Type: ApplicationFiled: March 28, 2022Publication date: September 28, 2023Applicant: Salesforce.com, Inc.Inventor: Jose Lejin P J
-
Publication number: 20230305698Abstract: A system for storing multiple petabytes of raw log data from cloud-based source data buckets into internal partitioned data storage in a data lake is provided. The system is configured to: deploy a log partitioner cluster comprising a plurality of log partitioner service instances for storing the raw log data in a partitioned manner; associate one or more of the source data buckets to each of a plurality of deployed log partitioner service instances by providing associated configuration metadata from a configuration service; fetch, via the log partitioner cluster, raw log data from associated source data buckets based on the associated configuration metadata, wherein the associated configuration metadata provides instructions for use by a log partitioner service instance to onboard raw log data; and place fetched raw log data, via the log partitioner cluster, in the internal partitioned log storage in accordance with the associated configuration metadata.Type: ApplicationFiled: March 28, 2022Publication date: September 28, 2023Applicant: salesforce.com, inc.Inventor: Lei Ye
-
Publication number: 20230300077Abstract: In some embodiments, a method stores an executable contract in a ledger. The executable contract is for a test to be performed on a network and includes terms for the test. The method receives an indication of traffic that is received at a network device. It is determined that the traffic is associated with the executable contract that is stored in the ledger based on the terms of the test. The method responds to the network device with a response indicating that the traffic is associated with the executable contract. The network device allows the traffic to be sent to a destination on the network to perform an operation for the test.Type: ApplicationFiled: March 16, 2022Publication date: September 21, 2023Applicant: salesforce.com, Inc.Inventors: Barrett Weisshaar, Luis Campo Giralte
-
Publication number: 20230297764Abstract: Disclosed herein are system, method, and computer program product embodiments for providing a non-obtrusive markup augmentation of websites during website localization. A method includes generating a localization descriptor for a localization element on a website, encoding the localization descriptor into unparsed syntax in code for the website, adding pseudo tag notations to the localization descriptor encoded in the unparsed syntax, extracting text that is being localized from the localization descriptor by identifying the pseudo tag notations, and creating a visual indicator for display as an overlay outside a visible area of the website to indicate a change in appearance of the website after localization, wherein a size of the visual indicator is scaled to match a size of the text as displayed in the visible area of the website when the code is rendered.Type: ApplicationFiled: March 15, 2022Publication date: September 21, 2023Applicant: salesforce.com, Inc.Inventor: Marcus Lenzner
-
Publication number: 20230297638Abstract: Disclosed herein are system, method, and computer program product embodiments for automatically detecting layout shifts or anomalies in a website during localization processes. A method includes generating a first rendering of a localization element on a website, extracting localized text from a localization descriptor for the localization element in code for the website, applying the localized text to the localization element on the website, generating a second rendering of the localization element on the website after the applying of the localized text, identifying an anomaly in the localization element displayed on the website by comparing the first and second renderings, and creating a visual indicator for display as an overlay outside a visible area of the website, wherein a location of the visual indicator corresponds to a location of the anomaly in the localization element displayed on the website.Type: ApplicationFiled: March 15, 2022Publication date: September 21, 2023Applicant: Salesforce.com, inc.Inventor: Marcus Lenzner
-
Patent number: 11765650Abstract: Techniques are disclosed relating to temporary networks of edge computing devices. In some embodiments, an edge computing device manages, in response to occurrence of an event, a temporary network of edge computing devices. Managing the temporary network includes: forming the temporary network, managing membership of edge computing devices in the temporary network, causing the temporary network to perform a specified set of tasks, transmitting, to a remote server computing system, information associated with the specified set of tasks, and causing disbanding of the temporary network after determining that the specified set of tasks is complete. The disclosed techniques for managing temporary networks of edge computing devices may advantageously reduce the overall use of a network's bandwidth by reducing the amount of communication with a remote server system.Type: GrantFiled: May 5, 2021Date of Patent: September 19, 2023Assignee: Salesforce.com, Inc.Inventors: Abhijit Sur, Rasmus Mencke
-
Publication number: 20230289339Abstract: A GraphQL server stores schemas for GraphQL APIs in a schema store, the schemas defining operations, types and fields of the APIs. Field resolvers are associated with the fields of the APIs to specify which of the data source is to be used to fulfill an API query request. When an event is detected that makes changes to a schema of any of the data sources, the schema store is accessed and an old field name is automatically replaced with a new field name in both the retrieved schemas and the field resolvers mapped to the retrieved schemas. Responsive to receiving an API request, it is determined whether the request has any field names that do not match the corresponding schema, and if so, the old field name is replaced with the new field name in the request.Type: ApplicationFiled: March 8, 2022Publication date: September 14, 2023Applicant: salesforce.com, inc.Inventor: Gaurav SAGAR
-
Publication number: 20230291736Abstract: Embodiments herein may relate to a technique for identification and verification of compliance with one or more pre-defined security policy sets for a network. Specifically, embodiments may include generation of an access control graph (ACG) that relates to the network. One or more paths of the ACG may be identified, and then compared against the pre-defined security policy sets. Other embodiments may be described or claimed.Type: ApplicationFiled: March 8, 2022Publication date: September 14, 2023Applicant: salesforce.com, inc.Inventors: Michal TREMBACZ, Gianstefano MONNI
-
Patent number: 11741142Abstract: Embodiments described herein provide document summarization systems and methods that utilize fine-tuning of pre-trained abstractive summarization models to produce summaries that more faithfully track the content of the documents. Such abstractive summarization models may be pre-trained using a corpus consisting of pairs of articles and associated summaries. For each article-summary pair, a pseudo label or control code is generated and represents a faithfulness of the summary with respect to the article. The pre-trained model is then fine-tuned based on the article-summary pairs and the corresponding control codes. The resulting fine-tuned models then provide improved faithfulness in document summarization tasks.Type: GrantFiled: January 31, 2022Date of Patent: August 29, 2023Assignee: salesforce.com, inc.Inventors: Haopeng Zheng, Semih Yavuz, Wojciech Kryscinski, Kazuma Hashimoto, Yingbo Zhou