Patents by Inventor Alexander Sim

Alexander Sim has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20250124227
    Abstract: A personalized natural language processing system tokenizes a plurality of sets of raw text data to generate a plurality of sets of tokenized text data for the plurality of users, respectively. The tokenized text data includes a sequence of tokens corresponding to the raw text data, the tokens at least identifying distinct words or portions of words in the raw text. The system appends predetermined user-specific tokens to the sets of tokenized text data from the users, respectively. Each predetermined user-specific token corresponds to one of the users. The system processes the sets of tokenized text data using the NLP model in accordance with the appended predetermined user-specific tokens to predict a personalized classification for the sets of tokenized text data from each of the users, and outputs the personalized classifications of the tokenized text data for each of the users.
    Type: Application
    Filed: December 23, 2024
    Publication date: April 17, 2025
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: Dimitrios Basile DIMITRIADIS, Vaishnavi SHRIVASTAVA, Milad SHOKOUHI, Robert Alexander SIM, Fatemehsadat MIRESHGHALLAH
  • Publication number: 20250053445
    Abstract: The present disclosure relates to systems and methods for an interactive, intelligent hub built around the completion of a task. This hub brings together resources, information, suggested steps, and other automated assistance to facilitate the completion of the task. AI-based assistance may indicate which steps can be completed by automated processes, and dispatch those processes, or suggest resources to assist in the completion of other steps. The hub displays the current status of the task, and lives until the completion of the task, or abandonment by the user.
    Type: Application
    Filed: October 28, 2024
    Publication date: February 13, 2025
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: Robert Alexander SIM, Ryen William WHITE, Omar SHAYA, Bernd Ingo PLONTSCH, Elnaz NOURI
  • Patent number: 12182511
    Abstract: A personalized natural language processing system tokenizes a plurality of sets of raw text data to generate a plurality of sets of tokenized text data for the plurality of users, respectively. The tokenized text data includes a sequence of tokens corresponding to the raw text data, the tokens at least identifying distinct words or portions of words in the raw text. The system appends predetermined user-specific tokens to the sets of tokenized text data from the users, respectively. Each predetermined user-specific token corresponds to one of the users. The system processes the sets of tokenized text data using the NLP model in accordance with the appended predetermined user-specific tokens to predict a personalized classification for the sets of tokenized text data from each of the users, and outputs the personalized classifications of the tokenized text data for each of the users.
    Type: Grant
    Filed: March 16, 2022
    Date of Patent: December 31, 2024
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Dimitrios Basile Dimitriadis, Vaishnavi Shrivastava, Milad Shokouhi, Robert Alexander Sim, Fatemehsadat Mireshghallah
  • Patent number: 12153956
    Abstract: The present disclosure relates to systems and methods for an interactive, intelligent hub built around the completion of a task. This hub brings together resources, information, suggested steps, and other automated assistance to facilitate the completion of the task. AI-based assistance may indicate which steps can be completed by automated processes, and dispatch those processes, or suggest resources to assist in the completion of other steps. The hub displays the current status of the task, and lives until the completion of the task, or abandonment by the user.
    Type: Grant
    Filed: July 18, 2022
    Date of Patent: November 26, 2024
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Robert Alexander Sim, Ryen William White, Omar Shaya, Bernd Ingo Plontsch, Elnaz Nouri
  • Publication number: 20240354713
    Abstract: A system and method to provide computer support for a meeting of invitees comprises accessing one or more sensory data streams providing digitized sensory data responsive to an activity of one or more of the invitees during the meeting, the one or more sensory data streams including at least one audio stream. The method also comprises subjecting the at least one audio stream to phonetic and situational computer modeling to recognize a sequence of words in the audio stream and to assign each word to an invitee, subjecting the sequence of words to semantic computer modeling to recognize a sequence of directives in the sequence of words, and releasing one or more output data streams based on the sequence of directives, the one or more output data streams including one or more notifications.
    Type: Application
    Filed: July 3, 2024
    Publication date: October 24, 2024
    Inventors: Robert Alexander SIM, Marcello MENDES HASEGAWA, Ryen William WHITE, Mudit JAIN, Tomer HERMELIN, Adi GERZI ROSENTHAL, Sagi HILLELI
  • Patent number: 12051046
    Abstract: A system and method to provide computer support for a meeting of invitees comprises accessing one or more sensory data streams providing digitized sensory data responsive to an activity of one or more of the invitees during the meeting, the one or more sensory data streams including at least one audio stream. The method also comprises subjecting the at least one audio stream to phonetic and situational computer modeling to recognize a sequence of words in the audio stream and to assign each word to an invitee, subjecting the sequence of words to semantic computer modeling to recognize a sequence of directives in the sequence of words, and releasing one or more output data streams based on the sequence of directives, the one or more output data streams including one or more notifications.
    Type: Grant
    Filed: August 5, 2021
    Date of Patent: July 30, 2024
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Robert Alexander Sim, Marcello Mendes Hasegawa, Ryen William White, Mudit Jain, Tomer Hermelin, Adi Gerzi Rosenthal, Sagi Hilleli
  • Publication number: 20240232405
    Abstract: Systems and methods are directed to building annotated models based on eyes-off data. Specifically, a synthetic data generation model is trained and used to further train a target model. The synthetic data generation model is trained within an eyes-off environment using an anonymity technique on confidential data. The synthetic data generation model is then used to create synthetic data that closely represents the confidential data but without any specific details that can be linked back to the confidential data. The synthetic data is then annotated and used to train the target model within an eyes-on environment. Subsequently, the target model is deployed back within the eyes-off environment to classify the confidential data.
    Type: Application
    Filed: October 24, 2022
    Publication date: July 11, 2024
    Inventors: David Benjamin LEVITAN, Robert Alexander SIM, Julia S. MCANALLEN, Huseyin Atahan INAN, Girish KUMAR, Xiang YUE
  • Publication number: 20240135015
    Abstract: Systems and methods are directed to building annotated models based on eyes-off data. Specifically, a synthetic data generation model is trained and used to further train a target model. The synthetic data generation model is trained within an eyes-off environment using an anonymity technique on confidential data. The synthetic data generation model is then used to create synthetic data that closely represents the confidential data but without any specific details that can be linked back to the confidential data. The synthetic data is then annotated and used to train the target model within an eyes-on environment. Subsequently, the target model is deployed back within the eyes-off environment to classify the confidential data.
    Type: Application
    Filed: October 23, 2022
    Publication date: April 25, 2024
    Inventors: David Benjamin LEVITAN, Robert Alexander SIM, Julia S. MCANALLEN, Huseyin Atahan INAN, Girish KUMAR, Xiang YUE
  • Publication number: 20230306313
    Abstract: Examples of ensemble knowledge transfer in collaborative learning include: receiving, at a primary node, from a plurality of remote nodes, a plurality of trained proxy machine learning (ML) models, wherein each proxy ML model is received from a different one of the plurality of remote nodes, and wherein each of the plurality of remote nodes is remote across a network from the primary node; training a primary ML model using the plurality of proxy ML models, wherein training the primary ML model comprises: for each of a plurality of training cases of a primary training dataset, weighting results from each of the proxy ML models based on at least a confidence of the respective proxy ML model regarding the training case.
    Type: Application
    Filed: March 22, 2022
    Publication date: September 28, 2023
    Inventors: Dimitrios Basile DIMITRIADIS, Antonio Andre MONTEIRO MANOEL, Robert Alexander SIM, Yae Jee CHO
  • Publication number: 20230297777
    Abstract: A personalized natural language processing system tokenizes a plurality of sets of raw text data to generate a plurality of sets of tokenized text data for the plurality of users, respectively. The tokenized text data includes a sequence of tokens corresponding to the raw text data, the tokens at least identifying distinct words or portions of words in the raw text. The system appends predetermined user-specific tokens to the sets of tokenized text data from the users, respectively. Each predetermined user-specific token corresponds to one of the users. The system processes the sets of tokenized text data using the NLP model in accordance with the appended predetermined user-specific tokens to predict a personalized classification for the sets of tokenized text data from each of the users, and outputs the personalized classifications of the tokenized text data for each of the users.
    Type: Application
    Filed: March 16, 2022
    Publication date: September 21, 2023
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: Dimitrios Basile DIMITRIADIS, Vaishnavi SHRIVASTAVA, Milad SHOKOUHI, Robert Alexander SIM, Fatemehsadat MIRESHGHALLAH
  • Publication number: 20220350654
    Abstract: The present disclosure relates to systems and methods for an interactive, intelligent hub built around the completion of a task. This hub brings together resources, information, suggested steps, and other automated assistance to facilitate the completion of the task. AI-based assistance may indicate which steps can be completed by automated processes, and dispatch those processes, or suggest resources to assist in the completion of other steps. The hub displays the current status of the task, and lives until the completion of the task, or abandonment by the user.
    Type: Application
    Filed: July 18, 2022
    Publication date: November 3, 2022
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: Robert Alexander SIM, Ryen William WHITE, Omar SHAYA, Bernd Ingo PLONTSCH, Elnaz NOURI
  • Patent number: 11461406
    Abstract: A system, computer implemented method, and computer storage medium encoded with a computer program, for identifying newly trending topics in a data stream. An example method includes: receiving text documents forming part of a data stream from one or more servers; identifying terms within the received text documents; deriving from the identified terms, a set of terms identified as abnormal by virtue of having a relatively high frequency of occurrence within the text documents received in a recent period compared with that expected from their historic occurrence; creating a first set of one or more clusters, each cluster including a group of terms from the set of terms identified as abnormal which through their degree of co-occurrence in the received text documents are considered to relate to the same topic; and comparing clusters of a further set with the clusters of the first set to determine whether a cluster of the further set pertains to the same topic.
    Type: Grant
    Filed: December 23, 2019
    Date of Patent: October 4, 2022
    Assignee: Verint Systems UK Limited
    Inventors: David Andrew Roberts, David Alexander Sim
  • Patent number: 11416290
    Abstract: The present disclosure relates to systems and methods for an interactive, intelligent hub built around the completion of a task. This hub brings together resources, information, suggested steps, and other automated assistance to facilitate the completion of the task. AI-based assistance may indicate which steps can be completed by automated processes, and dispatch those processes, or suggest resources to assist in the completion of other steps. The hub displays the current status of the task, and lives until the completion of the task, or abandonment by the user.
    Type: Grant
    Filed: May 28, 2020
    Date of Patent: August 16, 2022
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Robert Alexander Sim, Ryen William White, Omar Shaya, Bernd Ingo Plontsch, Elnaz Nouri
  • Publication number: 20210373943
    Abstract: The present disclosure relates to systems and methods for an interactive, intelligent hub built around the completion of a task. This hub brings together resources, information, suggested steps, and other automated assistance to facilitate the completion of the task. AI-based assistance may indicate which steps can be completed by automated processes, and dispatch those processes, or suggest resources to assist in the completion of other steps. The hub displays the current status of the task, and lives until the completion of the task, or abandonment by the user.
    Type: Application
    Filed: May 28, 2020
    Publication date: December 2, 2021
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: Robert Alexander SIM, Ryen William WHITE, Omar SHAYA, Bernd Ingo PLONTSCH, Elnaz NOURI
  • Publication number: 20210365895
    Abstract: A system and method to provide computer support for a meeting of invitees comprises accessing one or more sensory data streams providing digitized sensory data responsive to an activity of one or more of the invitees during the meeting, the one or more sensory data streams including at least one audio stream. The method also comprises subjecting the at least one audio stream to phonetic and situational computer modeling to recognize a sequence of words in the audio stream and to assign each word to an invitee, subjecting the sequence of words to semantic computer modeling to recognize a sequence of directives in the sequence of words, and releasing one or more output data streams based on the sequence of directives, the one or more output data streams including one or more notifications.
    Type: Application
    Filed: August 5, 2021
    Publication date: November 25, 2021
    Inventors: Robert Alexander SIM, Marcello MENDES HASEGAWA, Ryen William WHITE, Mudit JAIN, Tomer HERMELIN, Adi GERZI ROSENTHAL, Sagi HILLELI
  • Patent number: 11170759
    Abstract: A method, system, and computer program product for discriminating boilerplate text in documents, such as web pages. An example method includes: receiving documents structured as labelled text elements; generating a local language model for each labelled text element of the received documents; comparing local language models for different labelled text elements that have the same label; for each comparison of local language models, deriving a similarity indicator, and using the similarity indicators of all the comparisons to derive a similarity score for that label; using the similarity scores to determine labels associated with text elements comprising boilerplate text; and providing the textual content of the labelled text elements to a receiving computer system; and identifying the textual content of labelled text elements that include boilerplate text.
    Type: Grant
    Filed: December 23, 2019
    Date of Patent: November 9, 2021
    Assignee: Verint Systems UK Limited
    Inventors: David Alexander Sim, David Paul Austen Ryland
  • Patent number: 11113672
    Abstract: A system and method to provide computer support for a meeting of invitees comprises accessing one or more sensory data streams providing digitized sensory data responsive to an activity of one or more of the invitees during the meeting, the one or more sensory data streams including at least one audio stream. The method also comprises subjecting the at least one audio stream to phonetic and situational computer modeling to recognize a sequence of words in the audio stream and to assign each word to an invitee, subjecting the sequence of words to semantic computer modeling to recognize a sequence of directives in the sequence of words, and releasing one or more output data streams based on the sequence of directives, the one or more output data streams including one or more notifications.
    Type: Grant
    Filed: March 22, 2018
    Date of Patent: September 7, 2021
    Inventors: Robert Alexander Sim, Marcello Mendes Hasegawa, Ryen William White, Mudit Jain, Tomer Hermelin, Adi Gerzi Rosenthal, Sagi Hilleli
  • Publication number: 20210224324
    Abstract: The present disclosure relates to systems and methods for discovering relatedness between entities from a corpora of information by automatically extracting attributes from the plurality of heterogeneous entities in a graph. A standardized representation of the extracted attributes from the plurality of heterogeneous entities are propagated across the graph and these propagated attributes are used to find a degree to which the plurality of heterogeneous entities are associated with the extracted attributes. The degree to which the plurality of heterogeneous entities are associated with the extracted attributes is used to create a representation space illustrating a level of relatedness of an entity to another entity of the plurality of heterogeneous entities.
    Type: Application
    Filed: February 3, 2020
    Publication date: July 22, 2021
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: Adam FOURNEY, Robert Alexander SIM, Shane Frandon WILLIAMS, Paul Nathan BENNETT, Tara Lynn SAFAVI
  • Publication number: 20200236147
    Abstract: Embodiments described herein are implemented in authentication brokering systems where an authentication broker issues security tokens that represent its authentications of users. Client devices operated by the users store the security tokens and send them to resource providers. The resource providers authenticate and grant access to the users based on validation of the security tokens. Authentication related messages exchanged between the resource providers and the authentication broker are used to exchange authentication risk data that is obtained or derived by the resource providers and the authentication broker. The resource providers obtain authentication risk data directly from the authentication broker and indirectly, via the authentication broker, from each other. As security tokens are used or managed, authentication risk data is shared among the participants in the authentication brokering system.
    Type: Application
    Filed: April 6, 2020
    Publication date: July 23, 2020
    Inventors: Robert Alexander SIM, Akash Atul SHAH, Jisheng LIANG
  • Publication number: 20200218761
    Abstract: A system, computer implemented method, and computer storage medium encoded with a computer program, for identifying newly trending topics in a data stream. An example method includes: receiving text documents forming part of a data stream from one or more servers; identifying terms within the received text documents; deriving from the identified terms, a set of terms identified as abnormal by virtue of having a relatively high frequency of occurrence within the text documents received in a recent period compared with that expected from their historic occurrence; creating a first set of one or more clusters, each cluster including a group of terms from the set of terms identified as abnormal which through their degree of co-occurrence in the received text documents are considered to relate to the same topic; and comparing clusters of a further set with the clusters of the first set to determine whether a cluster of the further set pertains to the same topic.
    Type: Application
    Filed: December 23, 2019
    Publication date: July 9, 2020
    Inventors: David Andrew ROBERTS, David Alexander SIM