Patents by Inventor Junaid Ahmed

Junaid Ahmed has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Performing targeted searching based on a user profile

Patent number: 11921728

Abstract: Aspects of the present disclosure relate to systems and methods for performing targeted searching based on a user profile. In examples, a user profile including a user embedding may be retrieved based on the receipt of a user indication. The user embedding may be created based on one or more user interest. A plurality of document embeddings may be identified based on the user embedding, where each document embedding of the plurality of document embeddings is determined to be within a first distance of the user embedding. In examples, a ranking for each document embedding of the plurality of document embeddings may be generated, where the ranking for each document embedding of the plurality of document embeddings is based on the user embedding. At least one document may be recommend based on a ranking associated with a document embedding.

Type: Grant

Filed: January 29, 2021

Date of Patent: March 5, 2024

Assignee: Microsoft Technology Licensing, LLC

Inventors: Junaid Ahmed, Waleed Malik, Arnold Overwijk
System and method for forecasting location of target in monocular first person view

Patent number: 11893751

Abstract: This disclosure relates generally to system and method for forecasting location of target in monocular first person view. Conventional systems for location forecasting utilizes complex neural networks and hence are computationally intensive and requires high compute power. The disclosed system includes an efficient and light-weight RNN based network model for predicting motion of targets in first person monocular videos. The network model includes an auto-encoder in the encoding phase and a regularizing layer in the end helps us get better accuracy. The disclosed method relies entirely just on detection bounding boxes for prediction as well as training of the network model and is still capable of transferring zero-shot on a different dataset.

Type: Grant

Filed: August 18, 2021

Date of Patent: February 6, 2024

Assignee: Tata Consultancy Services Limited

Inventors: Junaid Ahmed Ansari, Brojeshwar Bhowmick
Extracting key phrase candidates from documents and producing topical authority ranking

Patent number: 11874882

Abstract: A system for extracting key phrase candidates from a corpus of documents, including a processor, a memory, and a program executing on the processor. The system is configured to run a key phrase model to extract one or more key phrase candidates from each document in the corpus and convert each extracted key phrase candidate into a feature vector. The key phrase model also filters the feature vectors to remove duplicates using a classifier that was trained on a set of key phrase pairs with manual labels indicating whether two key phrases are duplicates of each other, to produce remaining key phrase candidates. The system uses the remaining key phrase candidates in a computer-implemented application.

Type: Grant

Filed: July 2, 2019

Date of Patent: January 16, 2024

Assignee: Microsoft Technology Licensing, LLC

Inventors: Li Xiong, Chuan Hu, Arnold Overwijk, Junaid Ahmed
Document body vectorization and noise-contrastive training

Patent number: 11829374

Abstract: Document embedding vectors for each document of a corpus may be generated by combining embedding vectors for document subparts, thereby yielding a final embedding vector for the document. A machine learning model is trained using a query corpus and the document corpus, where the model generates a ranking score for a given (query, document) pair. During training, rankings scores are generated using the model, such that the training dataset is further refined using the generated ranking scores. For example, top documents and a negative document may be determined for a given query and subsequently used as training data. Multiple negative documents may therefore be determined for a given query. A negative document for a given query may be determined from the negative documents using noise-contrastive estimation. Such determined negative documents may be evaluated using a loss function during model training, thereby yielding a more robust model for search processing.

Type: Grant

Filed: March 19, 2021

Date of Patent: November 28, 2023

Assignee: Microsoft Technology Licensing, LLC

Inventors: Junaid Ahmed, Li Xiong, Arnold Overwijk, Chenyan Xiong
INFERRING INFORMATION ABOUT A WEBPAGE BASED UPON A UNIFORM RESOURCE LOCATOR OF THE WEBPAGE

Publication number: 20230342410

Abstract: Described herein are technologies related to inferring information about a webpage based upon semantics of a uniform resource location (URL) of the webpage. The URL is tokenized to create a sequence of tokens. An embedding for the URL is generated based upon the sequence of tokens, wherein the embedding is representative of semantics of the URL. Based upon the embedding for the URL, information about the webpage pointed to by the URL is inferred, the webpage is retrieved, and information is extracted from the webpage based upon the information inferred about the webpage.

Type: Application

Filed: June 30, 2023

Publication date: October 26, 2023

Inventors: Siarhei ALONICHAU, Aliaksei BONDARIONOK, Junaid AHMED
Automated structured textual content categorization accuracy with neural networks

Patent number: 11734559

Abstract: To provide automated categorization of structured textual content individual nodes of textual content, from a document object model encapsulation of the structured textual content, have a multidimensional vector associated with them, where the values of the various dimensions of the multidimensional vector are based on the textual content in the corresponding node, the visual features applied or associated with the textual content of the corresponding node, and positional information of the textual content of the corresponding node. The multidimensional vectors are input to a neighbor-imbuing neural network. The enhanced multidimensional vectors output by the neighbor-imbuing neural network are then be provided to a categorization neural network. The resulting output can be in the form of multidimensional vectors whose dimensionality is proportional to categories into which the structured textual content is to be categorized. A weighted merge takes into account multiple nodes that are grouped together.

Type: Grant

Filed: June 19, 2020

Date of Patent: August 22, 2023

Assignee: MICRSOFT TECHNOLOGY LICENSING, LLC

Inventors: Charumathi Lakshmanan, Ye Li, Arnold Overwijk, Chenyan Xiong, Jiguang Shen, Junaid Ahmed, Jiaming Guo
Inferring information about a webpage based upon a uniform resource locator of the webpage

Patent number: 11727077

Abstract: Described herein are technologies related to inferring information about a webpage based upon semantics of a uniform resource location (URL) of the webpage. The URL is tokenized to create a sequence of tokens. An embedding for the URL is generated based upon the sequence of tokens, wherein the embedding is representative of semantics of the URL. Based upon the embedding for the URL, information about the webpage pointed to by the URL is inferred, the webpage is retrieved, and information is extracted from the webpage based upon the information inferred about the webpage.

Type: Grant

Filed: February 5, 2021

Date of Patent: August 15, 2023

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Siarhei Alonichau, Aliaksei Bondarionok, Junaid Ahmed
Keyphase extraction beyond language modeling

Patent number: 11657223

Abstract: A system for extracting a key phrase from a document includes a neural key phrase extraction model (“BLING-KPE”) having a first layer to extract a word sequence from the document, a second layer to represent each word in the word sequence by ELMo embedding, position embedding, and visual features, and a third layer to concatenate the ELMo embedding, the position embedding, and the visual features to produce hybrid word embeddings. A convolutional transformer models the hybrid word embeddings to n-gram embeddings, and a feedforward layer converts the n-gram embeddings into a probability distribution over a set of n-grams and calculates a key phrase score of each n-gram. The neural key phrase extraction model is trained on annotated data based on a labeled loss function to compute cross entropy loss of the key phrase score of each n-gram as compared with a label from the annotated dataset.

Type: Grant

Filed: December 16, 2021

Date of Patent: May 23, 2023

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Li Xiong, Chuan Hu, Arnold Overwijk, Junaid Ahmed, Daniel Fernando Campos, Chenyan Xiong
Constructing a computer-implemented semantic document

Patent number: 11562593

Abstract: Technologies pertaining to electronic document understanding are described herein. A document is received, wherein the document includes a section of a type. An image of the document is generated, and a candidate region is identified in the image of the document, wherein the candidate region encompasses the section. A label is assigned to the candidate region based upon text of the section, wherein the label identifies the type of the section. An electronic document understanding task is performed based upon the label assigned to the candidate region.

Type: Grant

Filed: May 29, 2020

Date of Patent: January 24, 2023

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Ziliu Li, Junaid Ahmed, Kwok Fung Tang, Arnold Overwijk, Jue Wang, Charumathi Lakshmanan, Arindam Mitra
In-place encryption of a swap file on a host machine

Patent number: 11455182

Abstract: Systems and methods are described for encrypting a swap file in a computer system. The swap file can be encrypted by a background process executing on the computer system. Processing of paging swapping operations occurs independently and separately of the background encryption of the swap file. Processing a page swapping operation can include decrypting or encrypting data to be swapped involved in the paging operation depending on the paging operation and whether or not the data to be swapped is encrypted or not.

Type: Grant

Filed: May 3, 2019

Date of Patent: September 27, 2022

Assignee: VMware, Inc.

Inventors: Ishan Banerjee, Preeti Agarwal, Valeriy Zhuravlev, Nick M Ryan, Mohammed Junaid Ahmed
TOKENIZING ALPHANUMERIC TEXT THROUGH USE OF FINITE STATE MACHINES

Publication number: 20220284190

Abstract: Described herein are technologies related to tokenizing alphanumeric text through use of a tokenization algorithm that is at least partially implemented as a finite state machine. The tokenization algorithm is configured to output numeric identifiers that represent tokens or sub-tokens in the alphanumeric text.

Type: Application

Filed: March 2, 2021

Publication date: September 8, 2022

Inventors: Siarhei ALONICHAU, Junaid AHMED
Online disk encryption using mirror driver

Patent number: 11436034

Abstract: Provided are techniques for encrypting a virtual disk of a virtual computing instance (VCI) while the VCI is online and still running using a mirror driver. In certain aspects a mirror driver is a filter running in an I/O stack used for accessing a virtual disk, such that the mirror driver receives I/Os destined to the virtual disk and mirrors those I/Os to the virtual disk and one or more additional virtual disks. The mirror driver begins copying data from an unencrypted source virtual disk to a destination virtual disk, and the data is encrypted as it is stored in the destination virtual disk, while the VCI is still online. During the copying, as new writes are issued to the unencrypted source virtual disk from the VCI, the mirror driver mirrors the writes to both the unencrypted source virtual disk and the destination virtual disk.

Type: Grant

Filed: November 13, 2019

Date of Patent: September 6, 2022

Assignee: VMWARE, INC.

Inventor: Mohammed Junaid Ahmed
INFERRING INFORMATION ABOUT A WEBPAGE BASED UPON A UNIFORM RESOURCE LOCATOR OF THE WEBPAGE

Publication number: 20220253502

Abstract: Described herein are technologies related to inferring information about a webpage based upon semantics of a uniform resource location (URL) of the webpage. The URL is tokenized to create a sequence of tokens. An embedding for the URL is generated based upon the sequence of tokens, wherein the embedding is representative of semantics of the URL. Based upon the embedding for the URL, information about the webpage pointed to by the URL is inferred, the webpage is retrieved, and information is extracted from the webpage based upon the information inferred about the webpage.

Type: Application

Filed: February 5, 2021

Publication date: August 11, 2022

Inventors: Siarhei ALONICHAU, Aliaksei BONDARIONOK, Junaid AHMED
PERFORMING TARGETED SEARCHING BASED ON A USER PROFILE

Publication number: 20220245161

Abstract: Aspects of the present disclosure relate to systems and methods for performing targeted searching based on a user profile. In examples, a user profile including a user embedding may be retrieved based on the receipt of a user indication. The user embedding may be created based on one or more user interest. A plurality of document embeddings may be identified based on the user embedding, where each document embedding of the plurality of document embeddings is determined to be within a first distance of the user embedding. In examples, a ranking for each document embedding of the plurality of document embeddings may be generated, where the ranking for each document embedding of the plurality of document embeddings is based on the user embedding. At least one document may be recommend based on a ranking associated with a document embedding.

Type: Application

Filed: January 29, 2021

Publication date: August 4, 2022

Applicant: Microsoft Technology Licensing, LLC

Inventors: Junaid AHMED, Waleed MALIK, Arnold OVERWIJK
Generating a graph data structure that identifies relationships among topics expressed in web documents

Patent number: 11361028

Abstract: A technique produces a graph data structure based on at least partially unstructured information dispersed over web documents. The technique involves applying a machine-trained model to a set of documents (or, more generally “document units”) to identify topics in the documents. The technique then generates count information by counting the occurrences of the single topics and co-occurrences of parings of topics in the documents. The technique generates conditional probability information based on the count information. An instance of conditional probability information describes a probability that a first topic will occur, given an appearance of a second topic, and a probability that the second topic will occur, given an appearance of the first topic. The technique then formulates the conditional probability information in a graph data structure. The technique also provides an application system that utilizes the graph data structure to provide any kind of computer-implemented service to a user.

Type: Grant

Filed: June 9, 2020

Date of Patent: June 14, 2022

Assignee: Microsoft Technology Licensing, LLC

Inventors: Ziliu Li, Junaid Ahmed, Arnold Overwijk, Li Xiong, Xiao Liu
DOCUMENT BODY VECTORIZATION AND NOISE-CONTRASTIVE TRAINING

Publication number: 20220179871

Abstract: Document embedding vectors for each document of a corpus may be generated by combining embedding vectors for document subparts, thereby yielding a final embedding vector for the document. A machine learning model is trained using a query corpus and the document corpus, where the model generates a ranking score for a given (query, document) pair. During training, rankings scores are generated using the model, such that the training dataset is further refined using the generated ranking scores. For example, top documents and a negative document may be determined for a given query and subsequently used as training data. Multiple negative documents may therefore be determined for a given query. A negative document for a given query may be determined from the negative documents using noise-contrastive estimation. Such determined negative documents may be evaluated using a loss function during model training, thereby yielding a more robust model for search processing.

Type: Application

Filed: March 19, 2021

Publication date: June 9, 2022

Applicant: Microsoft Technology Licensing, LLC

Inventors: Junaid AHMED, Li XIONG, Arnold OVERWIJK, Chenyan XIONG
KEYPHASE EXTRACTION BEYOND LANGUAGE MODELING

Publication number: 20220108078

Abstract: A system for extracting a key phrase from a document includes a neural key phrase extraction model (“BLING-KPE”) having a first layer to extract a word sequence from the document, a second layer to represent each word in the word sequence by ELMo embedding, position embedding, and visual features, and a third layer to concatenate the ELMo embedding, the position embedding, and the visual features to produce hybrid word embeddings. A convolutional transformer models the hybrid word embeddings to n-gram embeddings, and a feedforward layer converts the n-gram embeddings into a probability distribution over a set of n-grams and calculates a key phrase score of each n-gram. The neural key phrase extraction model is trained on annotated data based on a labeled loss function to compute cross entropy loss of the key phrase score of each n-gram as compared with a label from the annotated dataset.

Type: Application

Filed: December 16, 2021

Publication date: April 7, 2022

Inventors: Li XIONG, Chuan HU, Arnold OVERWIJK, Junaid AHMED, Daniel Fernando CAMPOS, Chenyan XIONG
SYSTEM AND METHOD FOR FORECASTING LOCATION OF TARGET IN MONOCULAR FIRST PERSON VIEW

Publication number: 20220076431

Abstract: This disclosure relates generally to system and method for forecasting location of target in monocular first person view. Conventional systems for location forecasting utilizes complex neural networks and hence are computationally intensive and requires high compute power. The disclosed system includes an efficient and light-weight RNN based network model for predicting motion of targets in first person monocular videos. The network model includes an auto-encoder in the encoding phase and a regularizing layer in the end helps us get better accuracy. The disclosed method relies entirely just on detection bounding boxes for prediction as well as training of the network model and is still capable of transferring zero-shot on a different dataset.

Type: Application

Filed: August 18, 2021

Publication date: March 10, 2022

Applicant: Tata Consultancy Services Limited

Inventors: Junaid Ahmed ANSARI, Brojeshwar Bhowmick
Ranking computer-implemented search results based upon static scores assigned to webpages

Patent number: 11263225

Abstract: Technologies pertaining to ranking webpages in response to receipt of a query are described. A search engine receives a query and identifies webpages that are germane to the query. The search engine ranks the identified webpages to form a ranked list, wherein a first webpage is positioned in the ranked list based upon a static score assigned to the first webpage. The static score is based upon a weight assigned to a hyperlink in a second webpage, wherein the hyperlink points to the first webpage, and further wherein the weight is based upon a value of a feature of the hyperlink, such as a location of the hyperlink on the second webpage when the second webpage is rendered. Further, the second webpage includes several hyperlinks that point to different webpages, wherein each of the several hyperlinks has a different weight assigned thereto.

Type: Grant

Filed: May 19, 2020

Date of Patent: March 1, 2022

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Ziliu Li, Junaid Ahmed, Arnold Overwijk, Li Xiong
Keyphrase extraction beyond language modeling

Patent number: 11250214

Abstract: A system for extracting a key phrase from a document includes a neural key phrase extraction model (“BLING-KPE”) having a first layer to extract a word sequence from the document, a second layer to represent each word in the word sequence by ELMo embedding, position embedding, and visual features, and a third layer to concatenate the ELMo embedding, the position embedding, and the visual features to produce hybrid word embeddings. A convolutional transformer models the hybrid word embeddings to n-gram embeddings, and a feedforward layer converts the n-gram embeddings into a probability distribution over a set of n-grams and calculates a key phrase score of each n-gram. The neural key phrase extraction model is trained on annotated data based on a labeled loss function to compute cross entropy loss of the key phrase score of each n-gram as compared with a label from the annotated dataset.

Type: Grant

Filed: July 2, 2019

Date of Patent: February 15, 2022

Assignee: Microsoft Technology Licensing, LLC

Inventors: Li Xiong, Chuan Hu, Arnold Overwijk, Junaid Ahmed, Daniel Fernando Campos, Chenyan Xiong

1 2 3 4 5 … next