Patents by Inventor Victoria Mazel

Victoria Mazel has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Processing speech to text queries by optimizing conversion of speech queries to text

Patent number: 10339924

Abstract: Techniques for processing a speech to text query are described herein. The techniques may include receiving a plurality of speech to text translation alternatives for a phrase of a natural language query, and tagging and parsing each of the translation alternatives based on a static analysis of the known domain that is at least partially structured, known tags of the known domain, and custom rules. The techniques may also include ranking the translation alternatives based on the tagging and parsing and translating the phrase based on the ranking.

Type: Grant

Filed: May 2, 2016

Date of Patent: July 2, 2019

Assignee: International Business Machines Corporation

Inventors: Yigal S. Dayan, Josemina M. Magdalen, Irit Maharian, Victoria Mazel, Oren Paikowsky, Andrei Shtilman
Processing speech to text queries by optimizing conversion of speech queries to text

Patent number: 10332511

Abstract: Techniques for processing a speech to text query are described herein. The techniques may include receiving a plurality of speech to text translation alternatives for a phrase of a natural language query, and tagging and parsing each of the translation alternatives based on a static analysis of the known domain that is at least partially structured, known tags of the known domain, and custom rules. The techniques may also include ranking the translation alternatives based on the tagging and parsing and translating the phrase based on the ranking.

Type: Grant

Filed: July 24, 2015

Date of Patent: June 25, 2019

Assignee: International Business Machines Corporation

Inventors: Yigal S. Dayan, Josemina M. Magdalen, Irit Maharian, Victoria Mazel, Oren Paikowsky, Andrei Shtilman
Automatic analysis of repository structure to facilitate natural language queries

Patent number: 10242008

Abstract: Techniques for analyzing a repository are described herein. A method for analyzing a repository may include obtaining a list of known persons in a repository based on objects, users, and groups retrieved from the repository. The method may further select one of the objects having a field and a value, and then determine whether the field of the selected object is a facet based on a probability that the field of the selected object has a limited number of possible values. In analyzing the repository, a repository information archive may be generated. The repository information archive may include the relationship between the selected object and at least one other object, statistics and counts related to properties in the selected objects, and whether or not the field of the selected object is a facet.

Type: Grant

Filed: July 6, 2015

Date of Patent: March 26, 2019

Assignee: International Business Machines Corporation

Inventors: Yigal S. Dayan, Josemina M. Magdalen, Irit Maharian, Victoria Mazel, Oren Paikowsky, Andrei Shtilman
Automatic analysis of repository structure to facilitate natural language queries

Patent number: 10242009

Abstract: Techniques for analyzing a repository are described herein. A method for analyzing a repository may include obtaining a list of known persons in a repository based on objects, users, and groups retrieved from the repository. The method may further select one of the objects having a field and a value, and then determine whether the field of the selected object is a facet based on a probability that the field of the selected object has a limited number of possible values. In analyzing the repository, a repository information archive may be generated. The repository information archive may include the relationship between the selected object and at least one other object, statistics and counts related to properties in the selected objects, and whether or not the field of the selected object is a facet.

Type: Grant

Filed: April 27, 2016

Date of Patent: March 26, 2019

Assignee: International Business Machines Corporation

Inventors: Yigal S. Dayan, Josemina M. Magdalen, Irit Maharian, Victoria Mazel, Oren Paikowsky, Andrei Shtilman
Generating and executing query language statements from natural language

Patent number: 10180989

Abstract: Techniques for generating query language statements for a document repository are described herein. An example method includes detecting a search query corresponding to a document repository and generating a modified search query by adding atomic tags to the search query, the atomic tags being based on prior knowledge obtained by static analysis of the document repository and semantic rules. The method also includes generating enriched tags based on combinations of the atomic tags and any previously identified enriched tags and generating a first set of conditions based on combinations of the atomic tags and the generated enriched tags and generating a second set of conditions based on free-text conditions. The method also includes generating the query language statements based on the first set of conditions and the second set of conditions and displaying a plurality of documents from the document repository that satisfy the query language statements.

Type: Grant

Filed: July 24, 2015

Date of Patent: January 15, 2019

Assignee: International Business Machines Corporation

Inventors: Yigal S. Dayan, Josemina M. Magdalen, Irit Maharian, Victoria Mazel, Oren Paikowsky, Andrei Shtilman
Generating and executing query language statements from natural language

Patent number: 10169471

Abstract: Techniques for generating query language statements for a document repository are described herein. An example method includes detecting a search query corresponding to a document repository and generating a modified search query by adding atomic tags to the search query, the atomic tags being based on prior knowledge obtained by static analysis of the document repository and semantic rules. The method also includes generating enriched tags based on combinations of the atomic tags and any previously identified enriched tags and generating a first set of conditions based on combinations of the atomic tags and the generated enriched tags and generating a second set of conditions based on free-text conditions. The method also includes generating the query language statements based on the first set of conditions and the second set of conditions and displaying a plurality of documents from the document repository that satisfy the query language statements.

Type: Grant

Filed: April 28, 2016

Date of Patent: January 1, 2019

Assignee: International Business Machines Corporation

Inventors: Yigal S. Dayan, Josemina M. Magdalen, Irit Maharian, Victoria Mazel, Oren Paikowsky, Andrei Shtilman
PROCESSING SPEECH TO TEXT QUERIES BY OPTIMIZING CONVERSION OF SPEECH QUERIES TO TEXT

Publication number: 20170025120

Abstract: Techniques for processing a speech to text query are described herein. The techniques may include receiving a plurality of speech to text translation alternatives for a phrase of a natural language query, and tagging and parsing each of the translation alternatives based on a static analysis of the known domain that is at least partially structured, known tags of the known domain, and custom rules. The techniques may also include ranking the translation alternatives based on the tagging and parsing and translating the phrase based on the ranking.

Type: Application

Filed: July 24, 2015

Publication date: January 26, 2017

Inventors: Yigal S. Dayan, Josemina M. Magdalen, Irit Maharian, Victoria Mazel, Oren Paikowsky, Andrei Shtilman
PROCESSING SPEECH TO TEXT QUERIES BY OPTIMIZING CONVERSION OF SPEECH QUERIES TO TEXT

Publication number: 20170024459

Abstract: Techniques for processing a speech to text query are described herein. The techniques may include receiving a plurality of speech to text translation alternatives for a phrase of a natural language query, and tagging and parsing each of the translation alternatives based on a static analysis of the known domain that is at least partially structured, known tags of the known domain, and custom rules. The techniques may also include ranking the translation alternatives based on the tagging and parsing and translating the phrase based on the ranking.

Type: Application

Filed: May 2, 2016

Publication date: January 26, 2017

Inventors: Yigal S. Dayan, Josemina M. Magdalen, Irit Maharian, Victoria Mazel, Oren Paikowsky, Andrei Shtilman
GENERATING AND EXECUTING QUERY LANGUAGE STATEMENTS FROM NATURAL LANGUAGE

Publication number: 20170024431

Abstract: Techniques for generating query language statements for a document repository are described herein. An example method includes detecting a search query corresponding to a document repository and generating a modified search query by adding atomic tags to the search query, the atomic tags being based on prior knowledge obtained by static analysis of the document repository and semantic rules. The method also includes generating enriched tags based on combinations of the atomic tags and any previously identified enriched tags and generating a first set of conditions based on combinations of the atomic tags and the generated enriched tags and generating a second set of conditions based on free-text conditions. The method also includes generating the query language statements based on the first set of conditions and the second set of conditions and displaying a plurality of documents from the document repository that satisfy the query language statements.

Type: Application

Filed: April 28, 2016

Publication date: January 26, 2017

Inventors: Yigal S. Dayan, Josemina M. Magdalen, Irit Maharian, Victoria Mazel, Oren Paikowsky, Andrei Shtilman
GENERATING AND EXECUTING QUERY LANGUAGE STATEMENTS FROM NATURAL LANGUAGE

Publication number: 20170024443

Abstract: Techniques for generating query language statements for a document repository are described herein. An example method includes detecting a search query corresponding to a document repository and generating a modified search query by adding atomic tags to the search query, the atomic tags being based on prior knowledge obtained by static analysis of the document repository and semantic rules. The method also includes generating enriched tags based on combinations of the atomic tags and any previously identified enriched tags and generating a first set of conditions based on combinations of the atomic tags and the generated enriched tags and generating a second set of conditions based on free-text conditions. The method also includes generating the query language statements based on the first set of conditions and the second set of conditions and displaying a plurality of documents from the document repository that satisfy the query language statements.

Type: Application

Filed: July 24, 2015

Publication date: January 26, 2017

Inventors: Yigal S. Dayan, Josemina M. Magdalen, Irit Maharian, Victoria Mazel, Oren Paikowsky, Andrei Shtilman
AUTOMATIC ANALYSIS OF REPOSITORY STRUCTURE TO FACILITATE NATURAL LANGUAGE QUERIES

Publication number: 20170011050

Abstract: Techniques for analyzing a repository are described herein. A method for analyzing a repository may include obtaining a list of known persons in a repository based on objects, users, and groups retrieved from the repository. The method may further select one of the objects having a field and a value, and then determine whether the field of the selected object is a facet based on a probability that the field of the selected object has a limited number of possible values. In analyzing the repository, a repository information archive may be generated. The repository information archive may include the relationship between the selected object and at least one other object, statistics and counts related to properties in the selected objects, and whether or not the field of the selected object is a facet.

Type: Application

Filed: April 27, 2016

Publication date: January 12, 2017

Inventors: Yigal S. Dayan, Josemina M. Magdalen, Irit Maharian, Victoria Mazel, Oren Paikowsky, Andrei Shtilman
AUTOMATIC ANALYSIS OF REPOSITORY STRUCTURE TO FACILITATE NATURAL LANGUAGE QUERIES

Publication number: 20170011047

Abstract: Techniques for analyzing a repository are described herein. A method for analyzing a repository may include obtaining a list of known persons in a repository based on objects, users, and groups retrieved from the repository. The method may further select one of the objects having a field and a value, and then determine whether the field of the selected object is a facet based on a probability that the field of the selected object has a limited number of possible values. In analyzing the repository, a repository information archive may be generated. The repository information archive may include the relationship between the selected object and at least one other object, statistics and counts related to properties in the selected objects, and whether or not the field of the selected object is a facet.

Type: Application

Filed: July 6, 2015

Publication date: January 12, 2017

Inventors: Yigal S. Dayan, Josemina M. Magdalen, Irit Maharian, Victoria Mazel, Oren Paikowsky, Andrei Shtilman
Efficient implementation of morphology for agglutinative languages

Patent number: 9218336

Abstract: Constructing an automaton for automated analysis of agglutinative languages comprises: constructing an affix automaton for each of a plurality of affix types of an agglutinative language, where each of the affix types is associated with one or more affixes associated with a morphological concept; combining any of the affix automatons to form a plurality of template automatons, where each of the template automatons is patterned after any of a plurality of agglutination templates of any of the affix types for the language; and combining the template automatons into a master automaton.

Type: Grant

Filed: March 28, 2007

Date of Patent: December 22, 2015

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Daniel Cohen, Yigal Shai Dayan, Josemina Marcella Magdalen, Victoria Mazel
Efficient stemming of semitic languages

Patent number: 8438010

Abstract: A system for stemming words of Semitic languages, the system including an affix scanner configured to scan a word of a Semitic language for at least one affix according to a predefined scanning sequence and determine if at least one predefined scanning criterion is met, and a stemmer configured to remove the affix from the word if the predefined scanning criterion is met.

Type: Grant

Filed: December 6, 2007

Date of Patent: May 7, 2013

Assignee: International Business Machines Corporation

Inventors: Daniel Cohen, Yigal Shai Dayan, Josemina Magdalen, Victoria Mazel
Learning word segmentation from non-white space languages corpora

Patent number: 8165869

Abstract: Illustrative embodiments provide a computer implemented method, apparatus, and computer program product for learning word segmentation from non-white space language corpora. In one illustrative embodiment, the computer implemented method receives text input characters and calculates a ratio-measure for each pair of characters in the input characters. The computer implemented method further determines whether the ratio-measure of each pair of characters is equal to a predetermined threshold value. Responsive to determining the ratio-measure is less than the predetermined threshold value, and a local-minimum value, the computer method further identifies the pair as a weak pair and breaks the weak pair of characters.

Type: Grant

Filed: December 10, 2007

Date of Patent: April 24, 2012

Assignee: International Business Machines Corporation

Inventors: Josemina Marcolla Magdalon, Yigal Shai Dayan, Victoria Mazel, Daniel Cohen
Hybrid text segmentation using N-grams and lexical information

Patent number: 7917353

Abstract: A hybrid n-gram/lexical analysis tokenization system including a lexicon and a hybrid tokenizer operative to perform both N-gram tokenization of a text and lexical analysis tokenization of a text using the lexicon, and to construct either of an index and a classifier from the results of both of the N-gram tokenization and the lexical analysis tokenization, where the hybrid tokenizer is implemented in at least one of computer hardware and computer software and is embodied within a computer-readable medium.

Type: Grant

Filed: March 29, 2007

Date of Patent: March 29, 2011

Assignee: International Business Machines Corporation

Inventors: Yigal Shai Dayan, Josemina Marcella Magdalen, Victoria Mazel
Unsupervised stemming schema learning and lexicon acquisition from corpora

Patent number: 7912703

Abstract: Illustrated embodiments provide a computer implemented method, an apparatus, and a computer program product for unsupervised stemming schema learning and lexicon acquisition from corpora. In one illustrative embodiment, the computer implemented method obtains a corpus from corpora, analyzes the corpus to deduce a set of possible stemming schema and reviews and revises the set of possible stemming schema, to create a pruned set of stemming schema. The computer implemented method further deduces a lexicon from the corpus using the pruned set of stemming schema.

Type: Grant

Filed: December 10, 2007

Date of Patent: March 22, 2011

Assignee: International Business Machines Corporation

Inventors: Josemina Marcella Magdalen, Yigal Shai Dayan, Victoria Mazel, Daniel Cohen
EFFICIENT STEMMING OF SEMITIC LANGUAGES

Publication number: 20090150140

Abstract: A system for stemming words of Semitic languages, the system including an affix scanner configured to scan a word of a Semitic language for at least one affix according to a predefined scanning sequence and determine if at least one predefined scanning criterion is met, and a stemmer configured to remove the affix from the word if the predefined scanning criterion is met.

Type: Application

Filed: December 6, 2007

Publication date: June 11, 2009

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Daniel COHEN, Yigal Shai Dayan, Josemina Magdalen, Victoria Mazel
Learning word segmentation from non-white space languages corpora

Publication number: 20090150145

Abstract: Illustrative embodiments provide a computer implemented method, apparatus, and computer program product for learning word segmentation from non-white space language corpora. In one illustrative embodiment, the computer implemented method receives text input characters and calculates a ratio-measure for each pair of characters in the input characters. The computer implemented method further determines whether the ratio-measure of each pair of characters is equal to a predetermined threshold value. Responsive to determining the ratio-measure is less than the predetermined threshold value, and a local-minimum value, the computer method further identifies the pair as a weak pair and breaks the weak pair of characters.

Type: Application

Filed: December 10, 2007

Publication date: June 11, 2009

Inventors: Josemina Marcella Magdalen, Yigal Shai Dayan, Victoria Mazel, Daniel Cohen
UNSUPERVISED STEMMING SCHEMA LEARNING AND LEXICON ACQUISITION FROM CORPORA

Publication number: 20090150415

Abstract: Illustrated embodiments provide a computer implemented method, an apparatus, and a computer program product for unsupervised stemming schema learning and lexicon acquisition from corpora. In one illustrative embodiment, the computer implemented method obtains a corpus from corpora, analyzes the corpus to deduce a set of possible stemming schema and reviews and revises the set of possible stemming schema, to create a pruned set of stemming schema. The computer implemented method further deduces a lexicon from the corpus using the pruned set of stemming schema.

Type: Application

Filed: December 10, 2007

Publication date: June 11, 2009

Inventors: Josemina Marcella Magdalen, Yigal Shai Dayan, Victoria Mazel, Daniel Cohen

1 2 next