Patents by Inventor Tong-Haing Fin
Tong-Haing Fin has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 8356278Abstract: Under the present invention, a best practice violation (BPV) engine resource is first built. This typically occurs by: receiving sets of source code from a plurality of sources; detecting a programming language of each of the sets of source code; extracting code patterns from the sets of source code; defining meta data for each code pattern that indicates a quality thereof, classifying each code pattern; and assigning a rank to each of the code pattern based on it's corresponding meta data as well as a skill level and an experience level of the developer thereof. Once the BPV engine resource is built, it will be used to detect best practice violations in other sets of source code.Type: GrantFiled: September 10, 2009Date of Patent: January 15, 2013Assignee: International Business Machines CorporationInventors: Youssef Drissi, Tong-Haing Fin, Moon Ju Kim, Lev Koaakov, Juan Leon Rodriquez
-
Patent number: 7757163Abstract: An unknown annotator and its annotation type system are compared against a reference annotation type system. The comparison is done by providing a plurality of documents, and annotating each document using the reference set of document annotators, producing instances of reference annotation types, to generate a pre-annotated reference document set, and using the subject annotator and its subject annotation type system to generate a pre-annotated evaluation document set. Documents in the pre-annotated evaluation document set are compared to documents in the pre-annotated reference document set, and matches in location, within the compared documents, of instances of the subject annotation types and the reference annotation types are identified. Based on the matching data, reference document annotation types are selected that sufficiently correlate with the subject annotation type system.Type: GrantFiled: January 5, 2007Date of Patent: July 13, 2010Assignee: International Business Machines CorporationInventors: Yurdaer N. Doganata, Youssef Drissi, David A. Ferrucci, Tong-Haing Fin, Genady Grabarnik, Lev Kozakov
-
Patent number: 7676452Abstract: Disclosed is a method and apparatus for operating a search system for searching a search space containing domains each representing a collection of related documents. The method includes establishing a domain characterization model (DCM) that includes a DCM index containing keywords and, for each keyword, an identification of a domain where the keyword is found. In response to an original search query, the method forms one or a plurality of enhanced search queries in accordance with the DCM for searching a plurality of domains. Each enhanced search query is formed for searching a particular targeted domain and contains the original search query and at least one keyword associated with the particular domain. The search results are clustered and are displayed so as to be organized by the domains that were searched. The top search results returned in response to each enhanced search query belong to one of the targeted domains.Type: GrantFiled: July 23, 2002Date of Patent: March 9, 2010Assignee: International Business Machines CorporationInventors: Yurdaer N. Doganata, Youssef Drissi, Tong-Haing Fin, Genady Grabarnik, Moon J. Kim, Lev Kozakov, Sheng Ma, Juan Leon Rodriguez
-
Patent number: 7647579Abstract: Under the present invention, a best practice violation (BPV) engine resource is first built. This typically occurs by: receiving sets of source code from a plurality of sources; detecting a programming language of each of the sets of source code; extracting code patterns from the sets of source code; defining meta data for each code pattern that indicates a quality; classifying each code pattern; and assigning a rank to each of the code pattern based on its corresponding meta data as well as a skill level and an experience level of the developer. Once the BPV engine resource is built, it will be used to detect best practice violations in other sets of source code.Type: GrantFiled: March 31, 2004Date of Patent: January 12, 2010Assignee: International Business Machines CorporationInventors: Youssef Drissi, Tong-Haing Fin, Moon Ju Kim, Lev Kozakov, Juan Leon Rodriguez
-
Publication number: 20100005446Abstract: Under the present invention, a best practice violation (BPV) engine resource is first built. This typically occurs by: receiving sets of source code from a plurality of sources; detecting a programming language of each of the sets of source code; extracting code patterns from the sets of source code; defining meta data for each code pattern that indicates a quality thereof, classifying each code pattern; and assigning a rank to each of the code pattern based on it's corresponding meta data as well as a skill level and an experience level of the developer thereof. Once the BPV engine resource is built, it will be used to detect best practice violations in other sets of source code.Type: ApplicationFiled: September 10, 2009Publication date: January 7, 2010Inventors: Youssef Drissi, Tong-Haing Fin, Moon Ju Kim, Lev Koaakov, Juan Leon Rodriquez
-
Patent number: 7584453Abstract: Under the present invention source code can be shared among nodes in a peer-to-peer network. Specifically, source code to be shared will first be analyzed to identify a set of code patterns, assigned one or more predetermined categories based on the set of code patterns, and then selectively indexed. A developer desiring to use previously created source code when creating a new program can perform a context dependent search based on his/her working code to identify and retrieve relevant source code.Type: GrantFiled: August 30, 2007Date of Patent: September 1, 2009Assignee: International Business Machines CorporationInventors: Roberto J. Bayardo, Jr., Yurdaer N. Doganata, Youssef Drissi, Tong-Haing Fin, Moon J. Kim, Lev Kozakov, Juan L. Rodriguez
-
Patent number: 7577939Abstract: Under the present invention source code can be shared among nodes in a peer-to-peer network. Specifically, source code to be shared will first be analyzed to identify a set of code patterns, assigned one or more predetermined categories based on the set of code patterns, and then selectively indexed. A developer desiring to use previously created source code when creating a new program can perform a context dependent search based on his/her working code to identify and retrieve relevant source code.Type: GrantFiled: June 27, 2003Date of Patent: August 18, 2009Assignee: International Business Machines CorporationInventors: Roberto J. Bayardo, Jr., Yurdaer N. Doganata, Youssef Drissi, Tong-Haing Fin, Moon J. Kim, Lev Kozakov, Juan L. Rodriguez
-
Publication number: 20080168080Abstract: An unknown annotator and its annotation type system are compared against a reference annotation type system. The comparison is done by providing a plurality of documents, and annotating each document using the reference set of document annotators, producing instances of reference annotation types, to generate a pre-annotated reference document set, and using the subject annotator and its subject annotation type system to generate a pre-annotated evaluation document set. Documents in the pre-annotated evaluation document set are compared to documents in the pre-annotated reference document set, and matches in location, within the compared documents, of instances of the subject annotation types and the reference annotation types are identified. Based on the matching data, reference document annotation types are selected that sufficiently correlate with the subject annotation type system.Type: ApplicationFiled: January 5, 2007Publication date: July 10, 2008Inventors: Yurdaer N. Doganata, Youssef Drissi, David A. Ferrucci, Tong-Haing Fin, Genady Grabarnik, Lev Kozakov
-
Publication number: 20080016039Abstract: A document search and retrieval system and program product therefor. Search requests are provided to the system through a user interface. A document decomposer decomposes documents into individual document components. Document components and corresponding searchable indices for each are stored in a Component Library. A search unit searches stored document components responsive to search queries. A results validator compares document hitlists with a document type identified in a search query to select valid hitlists entries for a final hitlist. A document view assembly module collects identified document components and assembles them into a document for view at the user interface.Type: ApplicationFiled: July 10, 2007Publication date: January 17, 2008Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: GREGORY BROWN, THOMAS COFINO, YURDAER DOGANATA, YOUSSEF DRISSI, TONG-HAING FIN, MOON KIM, LEV KOZAKOV, JOHN MILLER
-
Publication number: 20070294264Abstract: Under the present invention source code can be shared among nodes in a peer-to-peer network. Specifically, source code to be shared will first be analyzed to identify a set of code patterns, assigned one or more predetermined categories based on the set of code patterns, and then selectively indexed. A developer desiring to use previously created source code when creating a new program can perform a context dependent search based on his/her working code to identify and retrieve relevant source code.Type: ApplicationFiled: August 30, 2007Publication date: December 20, 2007Inventors: Roberto Bayardo, Yurdaer Doganata, Youssef Drissi, Tong-Haing Fin, Moon Kim, Lev Kozakov, Juan Rodriguez
-
Patent number: 7260570Abstract: Search time is reduced with a search engine that includes a bi-directional inverted index facility which can be accessed with a keyword search in one of a number of languages and provide a listing of documents contained in all of those languages. The keywords in all supported languages are preferably stored in an inverted index lookup table cross referenced to documents in those language containing the keywords. Keywords with the same meaning in different languages are accessible together when that keyword in one of the languages is queried. The search engine containing the table can identify pertinent documents either in a selected language, a second language or in all supported languages, as determined by the user. Information about each document can include not only the identity of the document but also information used in ranking the documents such as the number of times that a keyword appears in that document, and the keywords proximity to other keywords.Type: GrantFiled: June 26, 2002Date of Patent: August 21, 2007Assignee: International Business Machines CorporationInventors: Gregory T. Brown, Yurdaer Nezihi Doganata, Youssef Drissi, Tong-Haing Fin, Noon Ju Kim, Lev Kozakov, Juan Leon-Rodriquez
-
Patent number: 7254571Abstract: A document search and retrieval system and program product therefor. Search requests are provided to the system through a user interface. A document decomposer decomposes documents into individual document components. Document components and corresponding searchable indices for each are stored in a Component Library. A search unit searches stored document components responsive to search queries. A results validator compares document hitlists with a document type identified in a search query to select valid hitlists entries for a final hitlist. A document view assembly module collects identified document components and assembles them into a document for view at the user interface.Type: GrantFiled: June 3, 2002Date of Patent: August 7, 2007Assignee: International Business Machines CorporationInventors: Gregory T. Brown, Thomas A. Cofino, Yurdaer Nezihi Doganata, Youssef Drissi, Tong-Haing Fin, Moon Ju Kim, Lev Kozakov, John Williams Miller
-
Patent number: 7092938Abstract: A search engine system is provided in which inputted data in various configurations is first passed through transitional translation layers which convert the data in its multiple configurations to a common or kernel computer form and language. The data is operated on and stored in the kernel form and language. The operated on data is then outputted through transitional translation layers and provided in the desired format the same or different from the input form and language.Type: GrantFiled: August 28, 2002Date of Patent: August 15, 2006Assignee: International Business Machines CorporationInventors: Gregory T. Brown, Thomas A. Cofino, Yurdaer Nezihi Doganata, Youssef Drissi, Tong-Haing Fin, Moon Ju Kim, Lev Kozakov
-
Patent number: 7089250Abstract: A method, system, and computer program product for associating events. A provided event dataset includes events occurring in each of N successive time intervals (N?3). Association rules pertaining to successive events in the event dataset are deduced. Sequences of events are generated from the association rules. Clusters of the sequences of events are formed. Sequences of the clusters are created. The clusters of the sequences may be used: to identify at least one event occurring in a time interval of the N time intervals as being a probable cause of at least one event occurring in a later-occurring time interval of the N time intervals; or to predict an occurrence of at least one event in a time interval occurring after the N time intervals, wherein the at least one event had occurred within the N time intervals.Type: GrantFiled: October 8, 2003Date of Patent: August 8, 2006Assignee: International Business Machines CorporationInventors: Yurdaer N. Doganata, Youssef Drissi, Tong-Haing Fin, Jun-Jang Jeng, Moon J. Kim, Lev Kozakov
-
Publication number: 20050223354Abstract: Under the present invention, a best practice violation (BPV) engine resource is first built. This typically occurs by: receiving sets of source code from a plurality of sources; detecting a programming language of each of the sets of source code; extracting code patterns from the sets of source code; defining meta data for each code pattern that indicates a quality thereof; classifying each code pattern; and assigning a rank to each of the code pattern based on it's corresponding meta data as well as a skill level and an experience level of the developer thereof. Once the BPV engine resource is built, it will be used to detect best practice violations in other sets of source code.Type: ApplicationFiled: March 31, 2004Publication date: October 6, 2005Applicant: International Business Machines CorporationInventors: Youssef Drissi, Tong-Haing Fin, Moon Kim, Lev Kozakov, Juan Rodriguez
-
Publication number: 20050080806Abstract: A method, system, and computer program product for associating events. A provided event dataset includes events occurring in each of N successive time intervals (N?3). Association rules pertaining to successive events in the event dataset are deduced. Sequences of events are generated from the association rules. Clusters of the sequences of events are formed. Sequences of the clusters are created. The clusters of the sequences may be used: to identify at least one event occurring in a time interval of the N time intervals as being a probable cause of at least one event occurring in a later-occurring time interval of the N time intervals; or to predict an occurrence of at least one event in a time interval occurring after the N time intervals, wherein the at least one event had occurred within the N time intervals.Type: ApplicationFiled: October 8, 2003Publication date: April 14, 2005Inventors: Yurdaer Doganata, Youssef Drissi, Tong-Haing Fin, Jun-Jang Jeng, Moon Kim, Lev Kozakov
-
Publication number: 20050065774Abstract: An automatic search index/meta data self-enhancement system includes a search system log analyzer, which periodically looks through the search system log, of a database, for search queries that did not bring satisfactory results; a search query analyzer which applies query enhancement techniques to the unsatisfactory queries by using glossary terms, synonyms, known typos, translated words, etc. to enhance the queries and categorize them; a relevant document finder which, based on the enhanced query terms and their categorization and subject, uncovers documents that were not previously found and links the documents to the query terms in the search index; and a search index/meta data enhancer, that enhances the meta/data of the documents based on the enhanced query terms in the search index, to reflect these new keywords to allow documents turned up by the enhanced query to be returned when similar future searches are entered by users.Type: ApplicationFiled: September 20, 2003Publication date: March 24, 2005Applicant: International Business Machines CorporationInventors: Yurdaer Doganata, Youssef Drissi, Tong-Haing Fin, Kozakov Lev, Moon Kim, Juan Rodriguez
-
Publication number: 20040267756Abstract: Under the present invention source code can be shared among nodes in a peer-to-peer network. Specifically, source code to be shared will first be analyzed to identify a set of code patterns, assigned one or more predetermined categories based on the set of code patterns, and then selectively indexed. A developer desiring to use previously created source code when creating a new program can perform a context dependent search based on his/her working code to identify and retrieve relevant source code.Type: ApplicationFiled: June 27, 2003Publication date: December 30, 2004Applicant: International Business Machines CorporationInventors: Roberto J. Bayardo, Yurdaer N. Doganata, Youssef Drissi, Tong-Haing Fin, Moon J. Kim, Lev Kozakov, Juan L. Rodriguez
-
Publication number: 20040044669Abstract: A search engine system is provided in which inputted data in various configurations is first passed through transitional translation layers which convert the data in its multiple configurations to a common or kernel computer form and language. The data is operated on and stored in the kernel form and language. The operated on data is then outputted through transitional translation layers and provided in the desired format the same or different from the input form and language.Type: ApplicationFiled: August 28, 2002Publication date: March 4, 2004Applicant: International Business Machines CorporationInventors: Gregory T. Brown, Thomas A. Cofino, Yurdaer Nezihi Doganata, Youssef Drissi, Tong-Haing Fin, Moon Ju Kim, Lev Kozakov
-
Publication number: 20040019588Abstract: Disclosed is a method and apparatus for operating a search system for searching a search space containing domains each representing a collection of related documents. The method includes establishing a domain characterization model (DCM) that includes a DCM index containing keywords and, for each keyword, an identification of a domain where the keyword is found. In response to an original search query, the method forms one or a plurality of enhanced search queries in accordance with the DCM for searching a plurality of domains. Each enhanced search query is formed for searching a particular targeted domain and contains the original search query and at least one keyword associated with the particular domain. The search results are clustered and are displayed so as to be organized by the domains that were searched. The top search results returned in response to each enhanced search query belong to one of the targeted domains.Type: ApplicationFiled: July 23, 2002Publication date: January 29, 2004Inventors: Yurdaer N. Doganata, Youssef Drissi, Tong-Haing Fin, Genady Grabarnik, Moon J. Kim, Lev Kozakov, Sheng Ma, Juan Leon Rodriguez