Patents by Inventor Ismail Baha Tutar

Ismail Baha Tutar has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

VISION-AND-LANGUAGE MODEL TRAINING

Publication number: 20250053616

Abstract: Systems and methods for improving training processes for image and text applications are described. A first set of embeddings may be generated based on a text input, and a second set of embeddings may be generated via a convolutional neural network (CNN), based on an input image. The first set of embeddings and the second set of embeddings may be utilized to generate a third set of embeddings including one or more placeholder values to be replaced. The placeholder values may be replaced based on predicted values, to reconstruct the input text and image.

Type: Application

Filed: October 29, 2024

Publication date: February 13, 2025

Inventors: Tarik Arici, Mehmet Saygin Seyfioglu, Ismail Baha Tutar, Tal Neiman
Multimodal input to text transformer

Patent number: 12190060

Abstract: The present disclosure presents a generative model configured to receive input regarding an item in two different modalities, such as text data and non-text data (including, for example, image or audio data), in order to generate output regarding the item that is determined based on a combination of both modalities' input. Specific relative positional and token type embeddings may be employed in an encoder portion of an encoder-decoder arrangement. An associated decoder may be trained to generate new text corresponding to diverse tasks based on the encoded representation of the two inputs as generated within the encoder. For example, the decoder may be utilized to generate attributes regarding the input item, auto-complete or auto-correct a title or description of the item, among other uses.

Type: Grant

Filed: September 30, 2022

Date of Patent: January 7, 2025

Assignee: Amazon Technologies, Inc.

Inventors: Amirhossein Tavanaei, Karim Bouyarmane, Ismail Baha Tutar
Vision-and-language model training

Patent number: 12141236

Abstract: Systems and methods for improving training processes for image and text applications are described. A first set of embeddings may be generated based on a text input, and a second set of embeddings may be generated via a convolutional neural network (CNN), based on an input image. The first set of embeddings and the second set of embeddings may be utilized to generate a third set of embeddings including one or more placeholder values to be replaced. The placeholder values may be replaced based on predicted values, to reconstruct the input text and image.

Type: Grant

Filed: November 15, 2021

Date of Patent: November 12, 2024

Assignee: AMAZON TECHNOLOGIES, INC.

Inventors: Tarik Arici, Mehmet Saygin Seyfioglu, Ismail Baha Tutar, Tal Neiman
Similarity detection based on token distinctiveness

Patent number: 12086851

Abstract: Methods, systems, and computer-readable media for similarity detection based on token distinctiveness are disclosed. A similarity detection system determines candidate items for a seed item based on a comparison of tokens in textual descriptions of the candidate items to tokens in a textual description of the seed item. The system uses machine learning to determine importance scores for the tokens of the seed item. An importance score is determined based on the frequency of the individual token and the frequency of the most commonly occurring token in the candidate items. Importance scores for the same token differ from the seed item to another seed item. Based on the importance scores, the system determines similarity scores for the candidate items to the seed item. The system selects, from the candidate items, a set of similar items to the seed item based (at least in part) on the similarity scores.

Type: Grant

Filed: November 14, 2019

Date of Patent: September 10, 2024

Assignee: Amazon Technologies, Inc.

Inventors: Tarik Arici, Ismail Baha Tutar
Artificial intelligence system supporting semi-supervised learning with iterative stacking

Patent number: 11928182

Abstract: A plurality of training iterations is conducted for a machine learning task. A given iteration includes generating a version of a stacking model using a portion of a labeled data set. Proposed labels are then obtained in the iteration using the generated version of the stacking model for a set of unlabeled records. The unlabeled records and their proposed labels are then used to generate versions of base models for the iteration. After the training iterations are completed, a trained ensemble of models including the stacking model and the base models is stored.

Type: Grant

Filed: November 30, 2020

Date of Patent: March 12, 2024

Assignee: Amazon Technologies, Inc.

Inventors: Tarik Arici, Vito Nicola Mandorino, Ismail Baha Tutar
Artificial intelligence system with composite models for multiple response-string queries

Patent number: 11526756

Abstract: Query types for which responses are to be generated with respect to records comprising text attributes are identified, including a text interpretation query type for which records may comprise one or more response-contributor strings. Results of the text interpretation query for a record are based at least partly on an extracted-property class of the records. A machine learning model comprising a first sub-model and a second sub-model is trained to extract results of the text interpretation query. The first sub-model generates an extracted-property class for a record, and the second sub-model predicts positions of response-contributor strings within the record based at least in part on the extracted-property class. A trained version of the model is stored.

Type: Grant

Filed: June 24, 2020

Date of Patent: December 13, 2022

Assignee: Amazon Technologies, Inc.

Inventors: Tarik Arici, Ismail Baha Tutar
Similarity detection system

Patent number: 10824922

Abstract: Similarity detection methods and systems are provided that utilize a convolutional neural network model to jointly learn string matching and semantic textual similarity as an image recognition solution. For example, in some embodiments described herein, the similarity detection system may receive two strings as input, transform the two strings into two separate vectors, generate a high-resolution image and a low-resolution image, apply one or more convolutional operations to each image, and determine string matching and semantic textual similarity based at least partly on the high-resolution image and the low-resolution image.

Type: Grant

Filed: December 12, 2018

Date of Patent: November 3, 2020

Assignee: Amazon Technologies, Inc.

Inventors: Ismail Baha Tutar, Tarik Arici
Techniques for dynamic variations of a search query

Patent number: 10497039

Abstract: Techniques are provided herein for utilizing a query variations engine. An attribute category for a search query may be identified from a search query history. A search query may be received from a user and a set of search results may be determined. A reduced set of search results may be generated from the set of search results based at least in part on the attribute category identified from the search query history. The user may be provided with the reduced set of search results.

Type: Grant

Filed: September 25, 2015

Date of Patent: December 3, 2019

Assignee: Amazon Technologies, Inc.

Inventors: Sachin Birendra Singh, Deept Kumar, Luis Antonio Diniz Fernandes de Morais Sarmento, Ismail Baha Tutar
Automated development and utilization of machine-learning generated classifiers

Patent number: 9818066

Abstract: Technologies are disclosed herein for generating and utilizing machine-learning generated classifiers configured to identify document relationships. Manually-generated data is captured that indicates if documents in a document corpus have a relationship with one another, such as duplicates or variations. A determination may then be made as to whether a classifier is to be generated based on the duplicate decision data. If a classifier is to be generated, machine learning may be performed using training documents from the document corpus and the duplicate decision data to generate a classifier. The machine-learning generated classifier may then be utilized in a production environment to determine whether a new document is a duplicate of documents in the document corpus and/or to identify other relationships between documents in the document corpus, such as documents that are similar or are variations of one another.

Type: Grant

Filed: February 17, 2015

Date of Patent: November 14, 2017

Assignee: Amazon Technologies, Inc.

Inventors: Roshan Ram Rammohan, Jeremy Leon Calvert, Deept Kumar, Ismail Baha Tutar