Patents by Inventor Amaç Herdagdelen

Amaç Herdagdelen has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20220147543
    Abstract: Systems, methods, and non-transitory computer-readable media can acquire a set of labels associated with a set of content items. Each label in the set of labels can be associated with at least one content item in the set of content items. It can be determined that at least two labels, out of the set of labels, are related. The at least two labels can be determined to be related based on at least one of a co-occurrence metric associated with the at least two labels or a topic similarity metric associated with the at least two labels. One label can be selected, out of the at least two labels, as being representative of the at least two labels.
    Type: Application
    Filed: January 24, 2022
    Publication date: May 12, 2022
    Inventors: Ehud Weinsberg, Bogdan State, Amaç Herdagdelen, Thomas Frederick Dimson, Bai Xiao, Danilo Torres de Sa Resende
  • Publication number: 20190303391
    Abstract: Systems, methods, and non-transitory computer-readable media can acquire a set of labels associated with a set of content items. Each label in the set of labels can be associated with at least one content item in the set of content items. It can be determined that at least two labels, out of the set of labels, are related. The at least two labels can be determined to be related based on at least one of a co-occurrence metric associated with the at least two labels or a topic similarity metric associated with the at least two labels. One label can be selected, out of the at least two labels, as being representative of the at least two labels.
    Type: Application
    Filed: April 9, 2019
    Publication date: October 3, 2019
    Inventors: Ehud Weinsberg, Li Yang, Bogdan State, Amaç Herdagdelen, Thomas Frederick Dimson, Bai Xiao, Danilo Torres de Sa Resende
  • Patent number: 10180935
    Abstract: A system for identifying language(s) for content items is disclosed. The system can identify different languages for content item words segments by identifying segment languages that maximize a probability across the segments. The probability can be a combination of: an author's likelihood for the language identified for the first word; a combination of transition frequencies for selected languages identified for words, the transition frequencies indicating likelihoods that a transition occurred to the selected language from the previous word's language; and a combination of observation probabilities indicating, for a given word in the content item, a likelihood the given word is in the identified language. For an in-vocabulary word, the observation probabilities can be based on learned probability for that word. For an out-of-vocabulary word, the probability can be computed by breaking the word into overlapping n-grams and computing combined learned probabilities that each n-gram is in the given language.
    Type: Grant
    Filed: February 2, 2017
    Date of Patent: January 15, 2019
    Assignee: Facebook, Inc.
    Inventors: Daniel Matthew Merl, Aditya Pal, Stanislav Funiak, Seyoung Park, Fei Huang, Amac Herdagdelen
  • Publication number: 20180293611
    Abstract: A primary online system infers interests for its users based on interest information in a secondary online system. Users that have user profiles in both the primary online system and the secondary online system are identified, and those associated with a target interest in the secondary online system are selected as part of a training group of that is used to generate an interest inference model that associates information in the training group's user profiles in the primary online system with the target interest. The interest inference model is applied to an input group of users in the primary online system to identify a seed group of users for whom the target interest can be inferred. The primary online system can then target content related to the target interest to an expanded group of users generated based on the seed group.
    Type: Application
    Filed: April 7, 2017
    Publication date: October 11, 2018
    Inventors: Sagar Chordia, Kai Ren, Adiitya Pal, Amac Herdagdelen, Tian Wang
  • Publication number: 20180189259
    Abstract: A system for identifying language(s) for content items is disclosed. The system can identify different languages for content item words segments by identifying segment languages that maximize a probability across the segments. The probability can be a combination of: an author's likelihood for the language identified for the first word; a combination of transition frequencies for selected languages identified for words, the transition frequencies indicating likelihoods that a transition occurred to the selected language from the previous word's language; and a combination of observation probabilities indicating, for a given word in the content item, a likelihood the given word is in the identified language. For an in-vocabulary word, the observation probabilities can be based on learned probability for that word. For an out-of-vocabulary word, the probability can be computed by breaking the word into overlapping n-grams and computing combined learned probabilities that each n-gram is in the given language.
    Type: Application
    Filed: February 2, 2017
    Publication date: July 5, 2018
    Inventors: Daniel Matthew Merl, Aditya Pal, Stanislav Funiak, Seyoung Park, Fei Huang, Amac Herdagdelen
  • Patent number: 10013417
    Abstract: Technology for media item and user language classification is disclosed. Media item classification may use models for associating language identifiers or probability distributions for multiple languages with linguistic content. User language classification may define user language models for attributing to users indications of languages they speak read, and/or write. The text classifications and user classifications may interact because the probability that given text is in a particular language may depend on a determined likelihood the user who produced the text speaks that language, or conversely, a user interacting with text in a particular language may increase the likelihood they understand that language. Some embodiments use language-tagged social media content to train n-gram classifiers for use with other social media content.
    Type: Grant
    Filed: July 17, 2017
    Date of Patent: July 3, 2018
    Assignee: FACEBOOK, INC.
    Inventors: Amac Herdagdelen, Bradley Ray Green
  • Patent number: 10002131
    Abstract: Technology for media item and user language classification is disclosed. Media item classification may use models for associating language identifiers or probability distributions for multiple languages with linguistic content. User language classification may define user language models for attributing to users indications of languages they speak read, and/or write. The text classifications and user classifications may interact because the probability that given text is in a particular language may depend on a determined likelihood the user who produced the text speaks that language, or conversely, a user interacting with text in a particular language may increase the likelihood they understand that language. Some embodiments use language-tagged social media content to train n-gram classifiers for use with other social media content.
    Type: Grant
    Filed: February 28, 2017
    Date of Patent: June 19, 2018
    Assignee: FACEBOOK, INC.
    Inventors: Amac Herdagdelen, Bradley Ray Green
  • Publication number: 20170315988
    Abstract: Technology for media item and user language classification is disclosed. Media item classification may use models for associating language identifiers or probability distributions for multiple languages with linguistic content. User language classification may define user language models for attributing to users indications of languages they speak read, and/or write. The text classifications and user classifications may interact because the probability that given text is in a particular language may depend on a determined likelihood the user who produced the text speaks that language, or conversely, a user interacting with text in a particular language may increase the likelihood they understand that language. Some embodiments use language-tagged social media content to train n-gram classifiers for use with other social media content.
    Type: Application
    Filed: July 17, 2017
    Publication date: November 2, 2017
    Inventors: Amac Herdagdelen, Bradley Ray Green
  • Publication number: 20170270102
    Abstract: Technology for media item and user language classification is disclosed. Media item classification may use models for associating language identifiers or probability distributions for multiple languages with linguistic content. User language classification may define user language models for attributing to users indications of languages they speak read, and/or write. The text classifications and user classifications may interact because the probability that given text is in a particular language may depend on a determined likelihood the user who produced the text speaks that language, or conversely, a user interacting with text in a particular language may increase the likelihood they understand that language. Some embodiments use language-tagged social media content to train n-gram classifiers for use with other social media content.
    Type: Application
    Filed: February 28, 2017
    Publication date: September 21, 2017
    Inventors: Amac Herdagdelen, Bradley Ray Green
  • Patent number: 9740687
    Abstract: Technology for media item and user language classification is disclosed. Media item classification may use models for associating language identifiers or probability distributions for multiple languages with linguistic content. User language classification may define user language models for attributing to users indications of languages they speak read, and/or write. The text classifications and user classifications may interact because the probability that given text is in a particular language may depend on a determined likelihood the user who produced the text speaks that language, or conversely, a user interacting with text in a particular language may increase the likelihood they understand that language. Some embodiments use language-tagged social media content to train n-gram classifiers for use with other social media content.
    Type: Grant
    Filed: June 11, 2014
    Date of Patent: August 22, 2017
    Assignee: Facebook, Inc.
    Inventors: Amac Herdagdelen, Bradley Ray Green
  • Publication number: 20170220677
    Abstract: In one embodiment, a method includes accessing a plurality of communications, each communication being associated with a particular content item and including a text of the communication; extracting, for each of the communications, quotations from the text of the communication; determining, for each extracted quotation, partitions of the quotation; grouping the extracted quotations into clusters based on a respective degree of similarity among their respective partitions; calculating a cluster-score for each cluster based on a frequency of occurrence of partitions of quotations in the cluster in the communications; and generating a quotations-module comprising representative quotations, each representative quotation being a quotation from a cluster having a cluster-score greater than a threshold cluster-score.
    Type: Application
    Filed: February 3, 2016
    Publication date: August 3, 2017
    Inventors: Rousseau Newaz Kazi, Mark Andrew Rich, Christina Joan Sauper, Amaç Herdagdelen, Soorya Vamsi Mohan Tanikella, Brett Matthew Westervelt, Maykel Andreas Louisa Jozef Anna Loomans, Adam Eugene Bussing, Shuyi Zheng
  • Publication number: 20170220578
    Abstract: In one embodiment, a method includes accessing a plurality of communications, each communication being associated with a particular content item and including a text of the communication; calculating, for each of the communications, sentiment-scores corresponding to sentiments, wherein each sentiment-score is based on a degree to which n-grams of the text of the communication match sentiment-words associated with the sentiments; determining, for each of the communications, an overall sentiment for the communication based on the calculated sentiment-scores for the communication; calculating sentiment levels for the particular content item corresponding sentiments, each sentiment level being based on a total number of communications determined to have the overall sentiment of the sentiment level; and generating a sentiments-module including sentiment-representations corresponding to overall sentiments having sentiment levels greater than a threshold sentiment level.
    Type: Application
    Filed: February 3, 2016
    Publication date: August 3, 2017
    Inventors: Rousseau Newaz Kazi, Mark Andrew Rich, Christina Joan Sauper, Amaç Herdagdelen, Soorya Vamsi Mohan Tanikella, Brett Matthew Westervelt, Maykel Andreas Louisa Jozef Anna Loomans, Adam Eugene Bussing, Shuyi Zheng
  • Publication number: 20170220579
    Abstract: In one embodiment, a method includes accessing a plurality of communications, each communication being associated with a particular content item and including a text of the communication; extracting, for each of the communications, n-grams from the text of the communication; identifying mention-terms from the extracted n-grams, each mention-term being a noun-phrase; calculating a term-score for each mention-term based on a frequency of occurrence of the mention-term in the communications; and generating a mentions-module including mentions, each mention including a mention-term having a term-score greater than a threshold term-score and text from communications comprising the mention-term.
    Type: Application
    Filed: February 3, 2016
    Publication date: August 3, 2017
    Inventors: Rousseau Newaz Kazi, Mark Andrew Rich, Christina Joan Sauper, Amaç Herdagdelen, Soorya Vamsi Mohan Tanikella, Brett Matthew Westervelt, Maykel Andreas Louisa Jozef Anna Loomans, Adam Eugene Bussing, Shuyi Zheng
  • Publication number: 20170220652
    Abstract: In one embodiment, a method includes receiving, from a client system of a first user, a request associated with a particular content item; identifying communications authored by one or more users, each identified communication being associated with the particular content item; generating one or more search-results modules related to the particular content item, each search-results module being of a particular module type, wherein each search-results module includes information from a subset of the identified communications, the information corresponding to the particular module type of the search-results module, and wherein a number of communications in the subset of the identified communications including each search-results module is greater than a module-specific threshold number of communications; and sending, to the client system, a search-results interface comprising one or more of the search-results modules.
    Type: Application
    Filed: February 3, 2016
    Publication date: August 3, 2017
    Inventors: Rousseau Newaz Kazi, Mark Andrew Rich, Christina Joan Sauper, Amaç Herdagdelen, Soorya Vamsi Mohan Tanikella, Brett Matthew Westervelt, Maykel Andreas Louisa Jozef Anna Loomans, Adam Eugene Bussing, Shuyi Zheng
  • Publication number: 20170220577
    Abstract: Systems, methods, and non-transitory computer-readable media can determine one or more respective topics of interest for at least some users of a social networking system. At least some of the topics can be propagated to at least a first user, wherein the propagated topics were determined to be of interest to users that follow the first user in the social networking system. At least one topic from the propagated topics for which the first user is a topical authority is determined.
    Type: Application
    Filed: January 29, 2016
    Publication date: August 3, 2017
    Inventors: Aditya Pal, Amaç Herdagdelen, Sourav Chatterji, Sumit Taank, Deepayan Chakrabarti
  • Publication number: 20170052954
    Abstract: Systems, methods, and non-transitory computer readable media configured to acquire data associated with a content item, the data associated with the content item including contextual information. The data associated with the content item can be provided to a model trained by machine learning. A set of hashtags associated with the content item can be determined based on the model.
    Type: Application
    Filed: August 18, 2015
    Publication date: February 23, 2017
    Inventors: Bogdan State, Amaç Herdagdelen, Maxime Boucher, Ehud Weinsberg
  • Publication number: 20170053013
    Abstract: Systems, methods, and non-transitory computer-readable media can acquire a set of labels associated with a set of content items. Each label in the set of labels can be associated with at least one content item in the set of content items. It can be determined that at least two labels, out of the set of labels, are related. The at least two labels can be determined to be related based on at least one of a co-occurrence metric associated with the at least two labels or a topic similarity metric associated with the at least two labels. One label can be selected, out of the at least two labels, as being representative of the at least two labels.
    Type: Application
    Filed: August 18, 2015
    Publication date: February 23, 2017
    Inventors: Ehud Weinsberg, Bogdan State, Amaç Herdagdelen, Thomas Frederick Dimson, Bai Xiao, Danilo Torres de Sa Resende
  • Publication number: 20170046630
    Abstract: Systems and methods are provided for classifying text based on language using one or more computer servers and storage devices. A computer-implemented method includes receiving a training set of elements, each element in the training set being assigned to one of a plurality of categories and having one of a plurality of content profiles associated therewith; receiving a population set of elements, each element in the population set having one of the plurality of content profiles associated therewith; and calculating using at least one of a stacked regression algorithm, a bias formula algorithm, a noise elimination algorithm, and an ensemble method consisting of a plurality of algorithmic methods the results of which are averaged, based on the content profiles associated with and the categories assigned to elements in the training set and the content profiles associated with the elements of the population set, a distribution of elements of the population set over the categories.
    Type: Application
    Filed: October 28, 2016
    Publication date: February 16, 2017
    Inventors: Aykut Firat, Mitchell Brooks, Christopher Bingham, Amac Herdagdelen, Gary King
  • Patent number: 9483544
    Abstract: Systems and methods are provided for classifying text based on language using one or more computer servers and storage devices. A computer-implemented method includes receiving a training set of elements, each element in the training set being assigned to one of a plurality of categories and having one of a plurality of content profiles associated therewith; receiving a population set of elements, each element in the population set having one of the plurality of content profiles associated therewith; and calculating using at least one of a stacked regression algorithm, a bias formula algorithm, a noise elimination algorithm, and an ensemble method consisting of a plurality of algorithmic methods the results of which are averaged, based on the content profiles associated with and the categories assigned to elements in the training set and the content profiles associated with the elements of the population set, a distribution of elements of the population set over the categories.
    Type: Grant
    Filed: March 14, 2013
    Date of Patent: November 1, 2016
    Assignee: Crimson Hexagon, Inc.
    Inventors: Aykut Firat, Mitchell Brooks, Christopher Bingham, Amac Herdagdelen, Gary King
  • Patent number: 9355091
    Abstract: Systems and methods are provided for classifying text based on language using one or more computer servers and storage devices. In general, the systems and methods can include a language classification module for classifying text of an input data set using the output of a training module. In an exemplary embodiment, a bootstrapping step feeds the output of the language classification module back into the training module to increase the accuracy of the language classification module. By iterating the language classification and training modules with input data having certain features, a user can tailor the language classification module for use with text having those or similar features.
    Type: Grant
    Filed: March 13, 2013
    Date of Patent: May 31, 2016
    Assignee: Crimson Hexagon, Inc.
    Inventors: Amac Herdagdelen, Aykut Firat, Christopher Bingham