Patents by Inventor Haggai Roitman

Haggai Roitman has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11947604
    Abstract: An example system includes a processor to receive a pseudo-relevance set including top results form a search engine in response to transmitting a set of concatenated messages of a dialog. The processor can execute a first fixed point operation on the pseudo-relevance set to generate weighted terms. The processor can also execute a second fixed point operation on a message graph including nodes with a heaviness based on the weighted terms.
    Type: Grant
    Filed: March 17, 2020
    Date of Patent: April 2, 2024
    Assignee: International Business Machines Corporation
    Inventors: Haggai Roitman, Doron Cohen, Yosi Mass, Shai Erera
  • Patent number: 11915129
    Abstract: A system and a computer-implemented method for ranking tabular data entities by likelihood of comprising answers for (natural language) queries, based on multimodal descriptions of the tabular data entities, comprising separate representations, which represent different aspects of the tabular data entities. The ranking is based on joint representations, generated from the query representation and separate representations of the tabular data entities' aspects, using gated multimodal units. The computer-implemented method may be used for applications such as web searches, data aggregation, and research tasks.
    Type: Grant
    Filed: April 29, 2020
    Date of Patent: February 27, 2024
    Assignee: International Business Machines Corporation
    Inventors: Roee Shraga, Haggai Roitman, Guy Feigenblat, Mustafa Canim
  • Publication number: 20230385887
    Abstract: A system may receive, via a user interface associated with an online marketplace, a request to generate the listing for the item, the request including a natural language text input as a title for the listing. The system may generate, based on inputting the natural language text to a transformer-based machine learning model, a predicted value for an item description attribute of the item. In some examples, a value of the item description attribute may be unspecified in the natural language text and may describe a feature associated with the item as produced. The system may then cause presentation, via the user interface associated with the online marketplace, of the listing including the predicted value for the item description attribute.
    Type: Application
    Filed: May 24, 2022
    Publication date: November 30, 2023
    Inventors: Gilad Eliyhau Fuchs, Haggai Roitman, Matan Mandelbrod
  • Patent number: 11797545
    Abstract: An example system includes a processor to receive concepts extracted from a result set corresponding to a query and result associations for each extracted concept. The processor is to build a graph based on the extracted concepts, wherein the graph comprises a number of nodes representing the extracted concepts and weighted edges representing similarity between concepts extracted from shared results. The processor is to partition the graph into subgraphs with vertices corresponding to candidate facets for vertices having higher sums of weighted edges. The processor is to rank the candidate facets. The processor is to select higher ranked candidate facets to use as facets. The processor is to output facets with a result set in response to the query.
    Type: Grant
    Filed: April 21, 2020
    Date of Patent: October 24, 2023
    Assignee: International Business Machines Corporation
    Inventors: Or Rivlin, Yosi Mass, Haggai Roitman, David Konopnicki
  • Patent number: 11790015
    Abstract: An illustrative embodiment includes a method for post-retrieval query performance prediction using hybrid document-passage information. The method includes: obtaining a set of documents responsive to a specific query; extracting document-level information regarding respective documents within the set; extracting passage-level information regarding respective passages of documents within the set; and estimating a likelihood that the set of documents includes relevant information to the specific query using both the document-level information and the passage-level information.
    Type: Grant
    Filed: September 5, 2022
    Date of Patent: October 17, 2023
    Assignee: International Business Machines Corporation
    Inventor: Haggai Roitman
  • Patent number: 11790885
    Abstract: A method, computer system, and a computer program product for natural language processing are provided. A first text corpus that includes semi-structured content that includes hierarchical nodes may be received. Some of the hierarchical nodes may be masked. Node embeddings and level embeddings may be generated from the semi-structured content of the first text corpus and from the masked hierarchical nodes. The node embeddings and the level embeddings may be included in a bi-directional transformer model. The bi-directional transformer model may be trained on the first text corpus by reducing loss from the bi-directional transformer model predicting the masked hierarchical nodes.
    Type: Grant
    Filed: May 6, 2021
    Date of Patent: October 17, 2023
    Assignee: International Business Machines Corporation
    Inventors: Haggai Roitman, Yosi Mass, Doron Cohen, Jatin Ganhotra
  • Patent number: 11775839
    Abstract: An example system includes a processor to receive a query. The processor can retrieve ranked candidates from an index based on the query. The processor can re-rank the ranked candidates using a Bidirectional Encoder Representations from Transformers (BERT) query-question (Q-q) model trained to match queries to questions of a frequently asked question (FAQ) dataset, wherein the BERT Q-q model is fine-tuned using paraphrases generated for the questions in the FAQ dataset. The processor can return the re-ranked candidates in response to the query.
    Type: Grant
    Filed: June 10, 2020
    Date of Patent: October 3, 2023
    Assignee: International Business Machines Corporation
    Inventors: Yosi Mass, Boaz Carmeli, Haggai Roitman, David Konopnicki
  • Patent number: 11720634
    Abstract: Training a machine learning language model to generate clarification questions for use in conversational search, including: Obtaining multiple dialogs between users and agents, each dialog including messages exchanged between a user and an agent, wherein one of the messages of each dialog includes a reference to a solution document provided by the agent. For each of the dialogs, operating a search engine to retrieve a text passage, relevant to at least one of the messages of the respective dialog, from the respective solution document. Training a machine learning language model to generate a new clarification question given at least one new message and multiple new text passages, wherein the training is based on a training set which comprises, for each of the dialogs: said at least one of the messages of the respective dialog, and the text passage retrieved for the respective dialog.
    Type: Grant
    Filed: March 9, 2021
    Date of Patent: August 8, 2023
    Assignee: International Business Machines Corporation
    Inventors: Yosi Mass, Haggai Roitman, Doron Cohen
  • Patent number: 11687514
    Abstract: Multimodal table encoding, including: Receiving an electronic document that contains a table. The table includes multiple rows, multiple columns, and a schema comprising column labels or row labels. The electronic document includes a description of the table which is located externally to the table. Next, operating separate machine learning encoders to separately encode the description, schema, each of the rows, and each of the columns of the table, respectively. The schema, the rows, and the columns are encoded together with end-of-column tokens and end-of-row tokens that mark an end of each column and row, respectively. Then, applying a machine learning gating mechanism to the encoded description, encoded schema, encoded rows, and encoded columns, to produce a fused encoding of the table, wherein the fused encoding is representative of both a structure of the table and a content of the table.
    Type: Grant
    Filed: July 15, 2020
    Date of Patent: June 27, 2023
    Assignee: International Business Machines Corporation
    Inventors: Roee Shraga, Haggai Roitman, Guy Feigenblat, Mustafa Canim
  • Patent number: 11636082
    Abstract: Ad-hoc table retrieval, including: Representing each of a plurality of tables as a multi-field text document in which: different modalities of the table are represented as separate fields, and a concatenation of all the modalities is represented as a separate, auxiliary field. Receiving a query. Executing the query on the multi-field text documents, to retrieve a list of preliminarily-ranked candidate tables out of the plurality of tables. Calculating an intrinsic table similarity score for each of the candidate tables, based on the query and the auxiliary field. Calculating an extrinsic table similarity score for each of the candidate tables, based on a cluster hypothesis of the candidate tables. Combining: the preliminary rankings, the intrinsic table similarity scores, and the extrinsic table similarity scores, to re-rank the candidate tables.
    Type: Grant
    Filed: June 23, 2020
    Date of Patent: April 25, 2023
    Assignee: International Business Machines Corporation
    Inventors: Haggai Roitman, Guy Feigenblat, Mustafa Canim, Roee Shraga
  • Publication number: 20220414161
    Abstract: An illustrative embodiment includes a method for post-retrieval query performance prediction using hybrid document-passage information. The method includes: obtaining a set of documents responsive to a specific query; extracting document-level information regarding respective documents within the set; extracting passage-level information regarding respective passages of documents within the set; and estimating a likelihood that the set of documents includes relevant information to the specific query using both the document-level information and the passage-level information.
    Type: Application
    Filed: September 5, 2022
    Publication date: December 29, 2022
    Inventor: HAGGAI ROITMAN
  • Publication number: 20220358906
    Abstract: A method, computer system, and a computer program product for natural language processing are provided. A first text corpus that includes semi-structured content that includes hierarchical nodes may be received. Some of the hierarchical nodes may be masked. Node embeddings and level embeddings may be generated from the semi-structured content of the first text corpus and from the masked hierarchical nodes. The node embeddings and the level embeddings may be included in a bi-directional transformer model. The bi-directional transformer model may be trained on the first text corpus by reducing loss from the bi-directional transformer model predicting the masked hierarchical nodes.
    Type: Application
    Filed: May 6, 2021
    Publication date: November 10, 2022
    Inventors: Haggai Roitman, Yosi Mass, Doron Cohen, Jatin Ganhotra
  • Patent number: 11487827
    Abstract: An illustrative embodiment includes a method for post-retrieval query performance prediction using hybrid document-passage information. The method includes: obtaining a set of documents responsive to a specific query; extracting document-level information regarding respective documents within the set; extracting passage-level information regarding respective passages of documents within the set; and estimating a likelihood that the set of documents includes relevant information to the specific query using both the document-level information and the passage-level information.
    Type: Grant
    Filed: December 27, 2018
    Date of Patent: November 1, 2022
    Assignee: International Business Machines Corporation
    Inventor: Haggai Roitman
  • Publication number: 20220300561
    Abstract: An embodiment for predicting a performance of a query in retrieving multifield documents is provided. The embodiment may include receiving a query from a user. The embodiment may also include retrieving a list of multifield documents from a corpus of documents in response to the query. The embodiment may further include generating a pseudo-effective (PE) reference-list for each field in the corpus of documents. The embodiment may also include executing one or more existing query performance prediction (QPP) methods on the retrieved list and each generated PE reference-list. The embodiment may further include deriving one or more extended QPP methods. The embodiment may also include estimating a performance of the query in obtaining the retrieved list of multifield documents based on the one or more extended QPP methods.
    Type: Application
    Filed: March 16, 2021
    Publication date: September 22, 2022
    Inventors: Yosi Mass, HAGGAI ROITMAN, Guy Feigenblat, Roee Shraga
  • Publication number: 20220292139
    Abstract: Training a machine learning language model to generate clarification questions for use in conversational search, including: Obtaining multiple dialogs between users and agents, each dialog including messages exchanged between a user and an agent, wherein one of the messages of each dialog includes a reference to a solution document provided by the agent. For each of the dialogs, operating a search engine to retrieve a text passage, relevant to at least one of the messages of the respective dialog, from the respective solution document. Training a machine learning language model to generate a new clarification question given at least one new message and multiple new text passages, wherein the training is based on a training set which comprises, for each of the dialogs: said at least one of the messages of the respective dialog, and the text passage retrieved for the respective dialog.
    Type: Application
    Filed: March 9, 2021
    Publication date: September 15, 2022
    Inventors: Yosi Mass, Haggai Roitman, Doron Cohen
  • Patent number: 11436288
    Abstract: An embodiment for predicting a performance of a query in retrieving multifield documents is provided. The embodiment may include receiving a query from a user. The embodiment may also include retrieving a list of multifield documents from a corpus of documents in response to the query. The embodiment may further include generating a pseudo-effective (PE) reference-list for each field in the corpus of documents. The embodiment may also include executing one or more existing query performance prediction (QPP) methods on the retrieved list and each generated PE reference-list. The embodiment may further include deriving one or more extended QPP methods. The embodiment may also include estimating a performance of the query in obtaining the retrieved list of multifield documents based on the one or more extended QPP methods.
    Type: Grant
    Filed: March 16, 2021
    Date of Patent: September 6, 2022
    Assignee: International Business Machines Corporation
    Inventors: Yosi Mass, Haggai Roitman, Guy Feigenblat, Roee Shraga
  • Patent number: 11416529
    Abstract: A computer-implemented method, computerized apparatus and computer program product for minimum coordination passage scoring. Given a candidate passage in a document collection potentially matching a query received, a set of overlapping terms between the candidate passage and the query is determined. For each overlapping term in the set, a first measure of a weight of the term in the query, a second measure of a weight of the term in the candidate passage, and a third measure of a specificity of the term in the document collection are calculated. a function of the first and second measure is evaluated to obtain a value reflecting a condition on the relation therebetween. A minimum coordination score representing a relative similarity between the candidate passage and the query is determined based on the value and the first, second and third measures obtained for each of the overlapping terms.
    Type: Grant
    Filed: December 2, 2019
    Date of Patent: August 16, 2022
    Assignee: International Business Machines Corporation
    Inventors: Doron Cohen, Haggai Roitman, Oren Sar-Shalom
  • Patent number: 11341138
    Abstract: A computer-implemented method, computerized apparatus and computer program product for query performance prediction, the method comprising: obtaining a result list comprising a listing of documents retrieved from a collection in response to a query; obtaining for each of the listed documents in the result list a score indicating a measure of the document's relevance to the query; sampling the result list to obtain a plurality of sub-lists each of which comprising a listing of documents subsumed by the result list; for each of the plurality of sub-lists, analyzing scores of the documents listed therein to obtain a sample performance estimator; and estimating performance of the result list based on the sample performance estimator of each of the plurality of sub-lists.
    Type: Grant
    Filed: December 6, 2017
    Date of Patent: May 24, 2022
    Assignee: International Business Machines Corporation
    Inventors: Doron Cohen, Shai Erera, Haggai Roitman, Bar Weiner
  • Patent number: 11327982
    Abstract: In a computerized information retrieval system: executing a search based on a query, to retrieve a set of tables ranked according to their relevancy to the query, wherein each of the tables includes one or more columns; selecting, from the retrieved tables, a predefined number of highest-ranking tables; scoring each column in the highest-ranking tables using a link analysis algorithm, and selecting, from the scored columns, a predefined number of highest-scoring columns; scoring terms contained within each of the highest-scoring columns, and selecting, from the scored terms, a predefined number of highest-scoring terms; re-ranking the highest-ranking tables by using the highest-scoring terms as pseudo relevance feedback that expands the query; and providing, as a response to the query, at least one of: the re-ranked tables, ordered according to the re-ranking, and data contained in at least one of the re-ranked tables, wherein the data are ordered according to the re-ranking.
    Type: Grant
    Filed: October 15, 2020
    Date of Patent: May 10, 2022
    Assignee: International Business Machines Corporation
    Inventors: Haggai Roitman, Guy Feigenblat, Roee Shraga, Bar Weiner
  • Publication number: 20220121669
    Abstract: In a computerized information retrieval system: executing a search based on a query, to retrieve a set of tables ranked according to their relevancy to the query, wherein each of the tables includes one or more columns; selecting, from the retrieved tables, a predefined number of highest-ranking tables; scoring each column in the highest-ranking tables using a link analysis algorithm, and selecting, from the scored columns, a predefined number of highest-scoring columns; scoring terms contained within each of the highest-scoring columns, and selecting, from the scored terms, a predefined number of highest-scoring terms; re-ranking the highest-ranking tables by using the highest-scoring terms as pseudo relevance feedback that expands the query; and providing, as a response to the query, at least one of: the re-ranked tables, ordered according to the re-ranking, and data contained in at least one of the re-ranked tables, wherein the data are ordered according to the re-ranking.
    Type: Application
    Filed: October 15, 2020
    Publication date: April 21, 2022
    Inventors: Haggai ROITMAN, Guy Feigenblat, Roee Shraga, Bar Weiner