Patents by Inventor Junaid Ahmed
Junaid Ahmed has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 11921728Abstract: Aspects of the present disclosure relate to systems and methods for performing targeted searching based on a user profile. In examples, a user profile including a user embedding may be retrieved based on the receipt of a user indication. The user embedding may be created based on one or more user interest. A plurality of document embeddings may be identified based on the user embedding, where each document embedding of the plurality of document embeddings is determined to be within a first distance of the user embedding. In examples, a ranking for each document embedding of the plurality of document embeddings may be generated, where the ranking for each document embedding of the plurality of document embeddings is based on the user embedding. At least one document may be recommend based on a ranking associated with a document embedding.Type: GrantFiled: January 29, 2021Date of Patent: March 5, 2024Assignee: Microsoft Technology Licensing, LLCInventors: Junaid Ahmed, Waleed Malik, Arnold Overwijk
-
Patent number: 11893751Abstract: This disclosure relates generally to system and method for forecasting location of target in monocular first person view. Conventional systems for location forecasting utilizes complex neural networks and hence are computationally intensive and requires high compute power. The disclosed system includes an efficient and light-weight RNN based network model for predicting motion of targets in first person monocular videos. The network model includes an auto-encoder in the encoding phase and a regularizing layer in the end helps us get better accuracy. The disclosed method relies entirely just on detection bounding boxes for prediction as well as training of the network model and is still capable of transferring zero-shot on a different dataset.Type: GrantFiled: August 18, 2021Date of Patent: February 6, 2024Assignee: Tata Consultancy Services LimitedInventors: Junaid Ahmed Ansari, Brojeshwar Bhowmick
-
Patent number: 11874882Abstract: A system for extracting key phrase candidates from a corpus of documents, including a processor, a memory, and a program executing on the processor. The system is configured to run a key phrase model to extract one or more key phrase candidates from each document in the corpus and convert each extracted key phrase candidate into a feature vector. The key phrase model also filters the feature vectors to remove duplicates using a classifier that was trained on a set of key phrase pairs with manual labels indicating whether two key phrases are duplicates of each other, to produce remaining key phrase candidates. The system uses the remaining key phrase candidates in a computer-implemented application.Type: GrantFiled: July 2, 2019Date of Patent: January 16, 2024Assignee: Microsoft Technology Licensing, LLCInventors: Li Xiong, Chuan Hu, Arnold Overwijk, Junaid Ahmed
-
Patent number: 11829374Abstract: Document embedding vectors for each document of a corpus may be generated by combining embedding vectors for document subparts, thereby yielding a final embedding vector for the document. A machine learning model is trained using a query corpus and the document corpus, where the model generates a ranking score for a given (query, document) pair. During training, rankings scores are generated using the model, such that the training dataset is further refined using the generated ranking scores. For example, top documents and a negative document may be determined for a given query and subsequently used as training data. Multiple negative documents may therefore be determined for a given query. A negative document for a given query may be determined from the negative documents using noise-contrastive estimation. Such determined negative documents may be evaluated using a loss function during model training, thereby yielding a more robust model for search processing.Type: GrantFiled: March 19, 2021Date of Patent: November 28, 2023Assignee: Microsoft Technology Licensing, LLCInventors: Junaid Ahmed, Li Xiong, Arnold Overwijk, Chenyan Xiong
-
Publication number: 20230342410Abstract: Described herein are technologies related to inferring information about a webpage based upon semantics of a uniform resource location (URL) of the webpage. The URL is tokenized to create a sequence of tokens. An embedding for the URL is generated based upon the sequence of tokens, wherein the embedding is representative of semantics of the URL. Based upon the embedding for the URL, information about the webpage pointed to by the URL is inferred, the webpage is retrieved, and information is extracted from the webpage based upon the information inferred about the webpage.Type: ApplicationFiled: June 30, 2023Publication date: October 26, 2023Inventors: Siarhei ALONICHAU, Aliaksei BONDARIONOK, Junaid AHMED
-
Patent number: 11734559Abstract: To provide automated categorization of structured textual content individual nodes of textual content, from a document object model encapsulation of the structured textual content, have a multidimensional vector associated with them, where the values of the various dimensions of the multidimensional vector are based on the textual content in the corresponding node, the visual features applied or associated with the textual content of the corresponding node, and positional information of the textual content of the corresponding node. The multidimensional vectors are input to a neighbor-imbuing neural network. The enhanced multidimensional vectors output by the neighbor-imbuing neural network are then be provided to a categorization neural network. The resulting output can be in the form of multidimensional vectors whose dimensionality is proportional to categories into which the structured textual content is to be categorized. A weighted merge takes into account multiple nodes that are grouped together.Type: GrantFiled: June 19, 2020Date of Patent: August 22, 2023Assignee: MICRSOFT TECHNOLOGY LICENSING, LLCInventors: Charumathi Lakshmanan, Ye Li, Arnold Overwijk, Chenyan Xiong, Jiguang Shen, Junaid Ahmed, Jiaming Guo
-
Patent number: 11727077Abstract: Described herein are technologies related to inferring information about a webpage based upon semantics of a uniform resource location (URL) of the webpage. The URL is tokenized to create a sequence of tokens. An embedding for the URL is generated based upon the sequence of tokens, wherein the embedding is representative of semantics of the URL. Based upon the embedding for the URL, information about the webpage pointed to by the URL is inferred, the webpage is retrieved, and information is extracted from the webpage based upon the information inferred about the webpage.Type: GrantFiled: February 5, 2021Date of Patent: August 15, 2023Assignee: MICROSOFT TECHNOLOGY LICENSING, LLCInventors: Siarhei Alonichau, Aliaksei Bondarionok, Junaid Ahmed
-
Patent number: 11657223Abstract: A system for extracting a key phrase from a document includes a neural key phrase extraction model (“BLING-KPE”) having a first layer to extract a word sequence from the document, a second layer to represent each word in the word sequence by ELMo embedding, position embedding, and visual features, and a third layer to concatenate the ELMo embedding, the position embedding, and the visual features to produce hybrid word embeddings. A convolutional transformer models the hybrid word embeddings to n-gram embeddings, and a feedforward layer converts the n-gram embeddings into a probability distribution over a set of n-grams and calculates a key phrase score of each n-gram. The neural key phrase extraction model is trained on annotated data based on a labeled loss function to compute cross entropy loss of the key phrase score of each n-gram as compared with a label from the annotated dataset.Type: GrantFiled: December 16, 2021Date of Patent: May 23, 2023Assignee: MICROSOFT TECHNOLOGY LICENSING, LLCInventors: Li Xiong, Chuan Hu, Arnold Overwijk, Junaid Ahmed, Daniel Fernando Campos, Chenyan Xiong
-
Patent number: 11562593Abstract: Technologies pertaining to electronic document understanding are described herein. A document is received, wherein the document includes a section of a type. An image of the document is generated, and a candidate region is identified in the image of the document, wherein the candidate region encompasses the section. A label is assigned to the candidate region based upon text of the section, wherein the label identifies the type of the section. An electronic document understanding task is performed based upon the label assigned to the candidate region.Type: GrantFiled: May 29, 2020Date of Patent: January 24, 2023Assignee: MICROSOFT TECHNOLOGY LICENSING, LLCInventors: Ziliu Li, Junaid Ahmed, Kwok Fung Tang, Arnold Overwijk, Jue Wang, Charumathi Lakshmanan, Arindam Mitra
-
Patent number: 11455182Abstract: Systems and methods are described for encrypting a swap file in a computer system. The swap file can be encrypted by a background process executing on the computer system. Processing of paging swapping operations occurs independently and separately of the background encryption of the swap file. Processing a page swapping operation can include decrypting or encrypting data to be swapped involved in the paging operation depending on the paging operation and whether or not the data to be swapped is encrypted or not.Type: GrantFiled: May 3, 2019Date of Patent: September 27, 2022Assignee: VMware, Inc.Inventors: Ishan Banerjee, Preeti Agarwal, Valeriy Zhuravlev, Nick M Ryan, Mohammed Junaid Ahmed
-
Publication number: 20220284190Abstract: Described herein are technologies related to tokenizing alphanumeric text through use of a tokenization algorithm that is at least partially implemented as a finite state machine. The tokenization algorithm is configured to output numeric identifiers that represent tokens or sub-tokens in the alphanumeric text.Type: ApplicationFiled: March 2, 2021Publication date: September 8, 2022Inventors: Siarhei ALONICHAU, Junaid AHMED
-
Patent number: 11436034Abstract: Provided are techniques for encrypting a virtual disk of a virtual computing instance (VCI) while the VCI is online and still running using a mirror driver. In certain aspects a mirror driver is a filter running in an I/O stack used for accessing a virtual disk, such that the mirror driver receives I/Os destined to the virtual disk and mirrors those I/Os to the virtual disk and one or more additional virtual disks. The mirror driver begins copying data from an unencrypted source virtual disk to a destination virtual disk, and the data is encrypted as it is stored in the destination virtual disk, while the VCI is still online. During the copying, as new writes are issued to the unencrypted source virtual disk from the VCI, the mirror driver mirrors the writes to both the unencrypted source virtual disk and the destination virtual disk.Type: GrantFiled: November 13, 2019Date of Patent: September 6, 2022Assignee: VMWARE, INC.Inventor: Mohammed Junaid Ahmed
-
Publication number: 20220253502Abstract: Described herein are technologies related to inferring information about a webpage based upon semantics of a uniform resource location (URL) of the webpage. The URL is tokenized to create a sequence of tokens. An embedding for the URL is generated based upon the sequence of tokens, wherein the embedding is representative of semantics of the URL. Based upon the embedding for the URL, information about the webpage pointed to by the URL is inferred, the webpage is retrieved, and information is extracted from the webpage based upon the information inferred about the webpage.Type: ApplicationFiled: February 5, 2021Publication date: August 11, 2022Inventors: Siarhei ALONICHAU, Aliaksei BONDARIONOK, Junaid AHMED
-
Publication number: 20220245161Abstract: Aspects of the present disclosure relate to systems and methods for performing targeted searching based on a user profile. In examples, a user profile including a user embedding may be retrieved based on the receipt of a user indication. The user embedding may be created based on one or more user interest. A plurality of document embeddings may be identified based on the user embedding, where each document embedding of the plurality of document embeddings is determined to be within a first distance of the user embedding. In examples, a ranking for each document embedding of the plurality of document embeddings may be generated, where the ranking for each document embedding of the plurality of document embeddings is based on the user embedding. At least one document may be recommend based on a ranking associated with a document embedding.Type: ApplicationFiled: January 29, 2021Publication date: August 4, 2022Applicant: Microsoft Technology Licensing, LLCInventors: Junaid AHMED, Waleed MALIK, Arnold OVERWIJK
-
Patent number: 11361028Abstract: A technique produces a graph data structure based on at least partially unstructured information dispersed over web documents. The technique involves applying a machine-trained model to a set of documents (or, more generally “document units”) to identify topics in the documents. The technique then generates count information by counting the occurrences of the single topics and co-occurrences of parings of topics in the documents. The technique generates conditional probability information based on the count information. An instance of conditional probability information describes a probability that a first topic will occur, given an appearance of a second topic, and a probability that the second topic will occur, given an appearance of the first topic. The technique then formulates the conditional probability information in a graph data structure. The technique also provides an application system that utilizes the graph data structure to provide any kind of computer-implemented service to a user.Type: GrantFiled: June 9, 2020Date of Patent: June 14, 2022Assignee: Microsoft Technology Licensing, LLCInventors: Ziliu Li, Junaid Ahmed, Arnold Overwijk, Li Xiong, Xiao Liu
-
Publication number: 20220179871Abstract: Document embedding vectors for each document of a corpus may be generated by combining embedding vectors for document subparts, thereby yielding a final embedding vector for the document. A machine learning model is trained using a query corpus and the document corpus, where the model generates a ranking score for a given (query, document) pair. During training, rankings scores are generated using the model, such that the training dataset is further refined using the generated ranking scores. For example, top documents and a negative document may be determined for a given query and subsequently used as training data. Multiple negative documents may therefore be determined for a given query. A negative document for a given query may be determined from the negative documents using noise-contrastive estimation. Such determined negative documents may be evaluated using a loss function during model training, thereby yielding a more robust model for search processing.Type: ApplicationFiled: March 19, 2021Publication date: June 9, 2022Applicant: Microsoft Technology Licensing, LLCInventors: Junaid AHMED, Li XIONG, Arnold OVERWIJK, Chenyan XIONG
-
Publication number: 20220108078Abstract: A system for extracting a key phrase from a document includes a neural key phrase extraction model (“BLING-KPE”) having a first layer to extract a word sequence from the document, a second layer to represent each word in the word sequence by ELMo embedding, position embedding, and visual features, and a third layer to concatenate the ELMo embedding, the position embedding, and the visual features to produce hybrid word embeddings. A convolutional transformer models the hybrid word embeddings to n-gram embeddings, and a feedforward layer converts the n-gram embeddings into a probability distribution over a set of n-grams and calculates a key phrase score of each n-gram. The neural key phrase extraction model is trained on annotated data based on a labeled loss function to compute cross entropy loss of the key phrase score of each n-gram as compared with a label from the annotated dataset.Type: ApplicationFiled: December 16, 2021Publication date: April 7, 2022Inventors: Li XIONG, Chuan HU, Arnold OVERWIJK, Junaid AHMED, Daniel Fernando CAMPOS, Chenyan XIONG
-
Publication number: 20220076431Abstract: This disclosure relates generally to system and method for forecasting location of target in monocular first person view. Conventional systems for location forecasting utilizes complex neural networks and hence are computationally intensive and requires high compute power. The disclosed system includes an efficient and light-weight RNN based network model for predicting motion of targets in first person monocular videos. The network model includes an auto-encoder in the encoding phase and a regularizing layer in the end helps us get better accuracy. The disclosed method relies entirely just on detection bounding boxes for prediction as well as training of the network model and is still capable of transferring zero-shot on a different dataset.Type: ApplicationFiled: August 18, 2021Publication date: March 10, 2022Applicant: Tata Consultancy Services LimitedInventors: Junaid Ahmed ANSARI, Brojeshwar Bhowmick
-
Patent number: 11263225Abstract: Technologies pertaining to ranking webpages in response to receipt of a query are described. A search engine receives a query and identifies webpages that are germane to the query. The search engine ranks the identified webpages to form a ranked list, wherein a first webpage is positioned in the ranked list based upon a static score assigned to the first webpage. The static score is based upon a weight assigned to a hyperlink in a second webpage, wherein the hyperlink points to the first webpage, and further wherein the weight is based upon a value of a feature of the hyperlink, such as a location of the hyperlink on the second webpage when the second webpage is rendered. Further, the second webpage includes several hyperlinks that point to different webpages, wherein each of the several hyperlinks has a different weight assigned thereto.Type: GrantFiled: May 19, 2020Date of Patent: March 1, 2022Assignee: MICROSOFT TECHNOLOGY LICENSING, LLCInventors: Ziliu Li, Junaid Ahmed, Arnold Overwijk, Li Xiong
-
Patent number: 11250214Abstract: A system for extracting a key phrase from a document includes a neural key phrase extraction model (“BLING-KPE”) having a first layer to extract a word sequence from the document, a second layer to represent each word in the word sequence by ELMo embedding, position embedding, and visual features, and a third layer to concatenate the ELMo embedding, the position embedding, and the visual features to produce hybrid word embeddings. A convolutional transformer models the hybrid word embeddings to n-gram embeddings, and a feedforward layer converts the n-gram embeddings into a probability distribution over a set of n-grams and calculates a key phrase score of each n-gram. The neural key phrase extraction model is trained on annotated data based on a labeled loss function to compute cross entropy loss of the key phrase score of each n-gram as compared with a label from the annotated dataset.Type: GrantFiled: July 2, 2019Date of Patent: February 15, 2022Assignee: Microsoft Technology Licensing, LLCInventors: Li Xiong, Chuan Hu, Arnold Overwijk, Junaid Ahmed, Daniel Fernando Campos, Chenyan Xiong