Patents by Inventor Aiyou Chen

Aiyou Chen has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20230036170
    Abstract: Techniques for preparing datasets for geo experiments and improving accuracy of geo experiments are presented herein. The system can access a dataset of a plurality of geographic pairs. Additionally, the system can calculate a first outcome estimate based on a difference in response data and a difference in input data for a first geographic pair. Moreover, the system can calculate a plurality of experimental uncertainty estimates associated with the plurality of geographic pairs during an experimental time interval. The system can access historical data associated with the plurality of geographic pairs. Furthermore, the system can determine a beta value and a trim rate that reduces a sum of the plurality estimates. Subsequently, the system can remove, based on the first outcome estimate and the beta value, the first geographic pair from the plurality of geographic pairs to generate the first subset of geographic pairs.
    Type: Application
    Filed: August 30, 2022
    Publication date: February 2, 2023
    Inventors: Aiyou Chen, Timothy Chun-Wai Au
  • Patent number: 11514274
    Abstract: Systems, methods and computer-readable storage media utilized to prepare datasets for geo experiments. One method includes receiving one or more input parameters. The method further includes extracting, from the data, training data. The method further includes calculating a difference in input data and a difference in response data of the training data. The method further includes determining a first plurality of geographic pairs. The method further includes extracting, from the data, evaluation data. The method further includes separating each geographic pair of the first plurality of geographic pairs into a treatment region or a control region for a plurality of simulations of a plurality of different simulation subsets for each of a plurality of different subsets of geographic pairs. The method further includes calculating a plurality of uncertainty estimates. The method further includes selecting a first subset of geographic pairs and providing the selected subset of geographic pairs.
    Type: Grant
    Filed: March 30, 2020
    Date of Patent: November 29, 2022
    Assignee: GOOGLE LLC
    Inventors: Aiyou Chen, Timothy C. Au, Nicolas Remy, Kevin Benac
  • Publication number: 20210312221
    Abstract: Systems, methods and computer-readable storage media utilized to prepare datasets for geo experiments. One method includes receiving one or more input parameters. The method further includes extracting, from the data, training data. The method further includes calculating a difference in input data and a difference in response data of the training data. The method further includes determining a first plurality of geographic pairs. The method further includes extracting, from the data, evaluation data. The method further includes separating each geographic pair of the first plurality of geographic pairs into a treatment region or a control region for a plurality of simulations of a plurality of different simulation subsets for each of a plurality of different subsets of geographic pairs. The method further includes calculating a plurality of uncertainty estimates. The method further includes selecting a first subset of geographic pairs and providing the selected subset of geographic pairs.
    Type: Application
    Filed: March 30, 2020
    Publication date: October 7, 2021
    Applicant: Google LLC
    Inventors: Aiyou Chen, Timothy C. Au, Nicolas Remy, Kevin Benac
  • Publication number: 20210312497
    Abstract: Systems, methods and computer-readable storage media utilized to prepare experimental datasets for experimental analysis systems. One method includes identifying, by one or more processing circuits, a dataset of a plurality of geographic pairs associated with a geo experiment. The method further includes calculating, by the one or more processing circuits, a difference in input data and a difference in response data between the first geographic region and the second geographic region of each geographic pair. The method further includes calculating, by the one or more processing circuits, a plurality of outcome estimates. The method further includes selecting, by the one or more processing circuits, a first subset of geographic pairs of the plurality of different subsets of geographic pairs based a first outcome estimate of the plurality of outcome estimates that is about a prespecified value on the outcome estimates and providing the selected subset of geographic pairs.
    Type: Application
    Filed: March 30, 2020
    Publication date: October 7, 2021
    Applicant: Google LLC
    Inventors: Aiyou Chen, Timothy C. Au
  • Patent number: 10740602
    Abstract: Systems and methods for assigning word fragments to lines of text in optical character recognition (OCR) extracted data can include at least one processor obtaining a plurality of word fragments from OCR generated data associated with an image. The at least one processor can determine vertical coordinates of each of the word fragments in the image. The at least one processor can cluster the plurality of word fragments into one or more clusters of word fragments based on the vertical coordinates of the plurality of word fragments. The at least one processor can assign each word fragment of a respective cluster to a corresponding text line based on the clustering.
    Type: Grant
    Filed: April 18, 2018
    Date of Patent: August 11, 2020
    Assignee: Google LLC
    Inventors: Ivan Ordonez, Swaminathan Krishnamurthy, David Paul, Tushar Udeshi, Aiyou Chen
  • Publication number: 20190325211
    Abstract: Systems and methods for assigning word fragments to lines of text in optical character recognition (OCR) extracted data can include at least one processor obtaining a plurality of word fragments from OCR generated data associated with an image. The at least one processor can determine vertical coordinates of each of the word fragments in the image. The at least one processor can cluster the plurality of word fragments into one or more clusters of word fragments based on the vertical coordinates of the plurality of word fragments. The at least one processor can assign each word fragment of a respective cluster to a corresponding text line based on the clustering.
    Type: Application
    Filed: April 18, 2018
    Publication date: October 24, 2019
    Applicant: GOOGLE LLC
    Inventors: Ivan Ordonez, Swaminathan Krishnamurthy, David Paul, Tushar Udeshi, Aiyou Chen
  • Patent number: 10445388
    Abstract: Systems, methods, and computer-readable storage media that may be used to generate causal models and calculate a selection bias in mixed media. In some embodiments, the selection bias calculation is in search sponsored content in the context of mixed media modeling. In some embodiments, a method for search bias correction is based on the back-door criterion from causal inference.
    Type: Grant
    Filed: September 19, 2017
    Date of Patent: October 15, 2019
    Assignee: Google LLC
    Inventors: David Chan, Yueqing Wang, Aiyou Chen, James Koehler, Yuxue Jin, Michael Perry, Yunting Sun
  • Publication number: 20190087497
    Abstract: Systems, methods, and computer-readable storage media that may be used to generate causal models and calculate a selection bias in mixed media. In some embodiments, the selection bias calculation is in search sponsored content in the context of mixed media modeling. In some embodiments, a method for search bias correction is based on the back-door criterion from causal inference.
    Type: Application
    Filed: September 19, 2017
    Publication date: March 21, 2019
    Applicant: Google Inc.
    Inventors: David Chan, Yueqing Wang, Aiyou Chen, James Koehler, Yuxue Jin, Michael Perry, Yunting Sun
  • Patent number: 10223728
    Abstract: Systems and methods of directed item consumption recommendations are disclosed which include generating, with a server, empirical transition matrix data that includes row data for a first item and column data for a second item, and an entry in the empirical transition matrix data for a number of users that acquire the second item after the first item, generating, with the server, metadata transition matrix data by partitioning items for each item metadata type, setting a uniform transition probability for all items in the partition, and summing the uniform transition probabilities across all metadata types, generating, with the server, transition probability matrix data by multiplying the metadata transition matrix data with a weight parameter, adding the result to the empirical transition matrix data, and normalizing each row, and providing item recommendations to a user computing device communicatively coupled to the server according to the generated transition probability matrix data.
    Type: Grant
    Filed: December 9, 2014
    Date of Patent: March 5, 2019
    Assignee: Google LLC
    Inventors: Aiyou Chen, Jeffrey David Oldham
  • Publication number: 20160162975
    Abstract: Systems and methods of directed item consumption recommendations are disclosed which include generating, with a server, empirical transition matrix data that includes row data for a first item and column data for a second item, and an entry in the empirical transition matrix data for a number of users that acquire the second item after the first item, generating, with the server, metadata transition matrix data by partitioning items for each item metadata type, setting a uniform transition probability for all items in the partition, and summing the uniform transition probabilities across all metadata types, generating, with the server, transition probability matrix data by multiplying the metadata transition matrix data with a weight parameter, adding the result to the empirical transition matrix data, and normalizing each row, and providing item recommendations to a user computing device communicatively coupled to the server according to the generated transition probability matrix data.
    Type: Application
    Filed: December 9, 2014
    Publication date: June 9, 2016
    Inventors: Aiyou Chen, Jeffrey David Oldham
  • Publication number: 20150363502
    Abstract: Systems and techniques are provided for optimizing personalized recommendations with longitudinal data and a future objective. An identifier may be received for content items. A user content item history including a list identifying a previously acquired content item may be received. Content item metadata may be received including a correlation between the previously acquired content item and a content item for which an identifier was received, and a correlation between a content item for which an identifier was received and fulfillment of a future objective. A joint probability may be determined for each content item based on the user content item history and the content item metadata, including the probability that the content item will be acquired by the user after being recommended to the user and that a future objective will be fulfilled after the content item is acquired by the user.
    Type: Application
    Filed: June 16, 2014
    Publication date: December 17, 2015
    Inventors: Aiyou Chen, James Robert Koehler, Nicolas Remy, Makoto Uchida
  • Patent number: 9069962
    Abstract: A method and computer product are presented for identifying Internet worm propagation based upon changes in packet arrival rates at a network connection. First, unsolicited (i.e., packets that were not requested by the receiver) traffic is separated from solicited traffic at the network connection. The unsolicited traffic arrival patterns are monitored and analyzed for any changes. Once changes in the unsolicited traffic arrival patterns are detected, the changes are mathematically analyzed to detect growth trends. The presence of growth trends that follow certain key characteristics indicate whether the changes are due to worm propagation.
    Type: Grant
    Filed: December 4, 2013
    Date of Patent: June 30, 2015
    Assignee: Alcatel Lucent
    Inventors: Tian Bu, Aiyou Chen, Scott Alan Vander Wiel, Thomas Woo
  • Patent number: 9047362
    Abstract: In one aspect, a processing device of an information processing system is operative to perform high-dimensional stratified sampling of a database comprising a plurality of records arranged in overlapping sub-groups. For a given record, the processing device determines which of the sub-groups the given record is associated with, and for each of the sub-groups associated with the given record, checks if a sampling rate of the sub-group is less than a specified sampling rate. If the sampling rate of each of the sub-groups is less than the specified sampling rate, the processing device samples the given record, and otherwise does not sample the given record. The determine, check and sample operations are repeated for additional records, and samples resulting from the sample operations are processed to generate information characterizing the database.
    Type: Grant
    Filed: October 15, 2013
    Date of Patent: June 2, 2015
    Assignee: Alcatel Lucent
    Inventors: Aiyou Chen, Ming Xiong
  • Patent number: 8931088
    Abstract: In one embodiment, a counting method of the invention uses an adaptive sketching-update process to compress an unknown cardinality into a counter value that counts the number of binary ones in a hashed bitmap vector. The sketching-update process is probabilistic in nature and uses bit-flip probabilities that are adaptively decreased as the counter value increases. Parameters of the sketching-update process are selected so that the relative error of cardinality estimates obtained based on the counter values is relatively small and substantially constant over a relatively wide range of cardinalities, e.g., from one to about one million. Due to the latter property, the counting method can advantageously be implemented in the form of embedded software that relies on a relatively small, fixed amount of memory.
    Type: Grant
    Filed: March 26, 2010
    Date of Patent: January 6, 2015
    Assignee: Alcatel Lucent
    Inventors: Aiyou Chen, Jin Cao, Lawrence E. Menten
  • Patent number: 8922433
    Abstract: The present invention provides a method of unsupervised learning and location for tracking users in a wireless communication system. One embodiment of the method includes forming a signal map of a geographic area using unlabeled values of one or more signals so that the signal map relates locations in the geographic area to values of the signal(s).
    Type: Grant
    Filed: August 27, 2010
    Date of Patent: December 30, 2014
    Assignee: Alcatel Lucent
    Inventors: Philip Whiting, Aiyou Chen
  • Publication number: 20140181978
    Abstract: A method and computer product are presented for identifying Internet worm propagation based upon changes in packet arrival rates at a network connection. First, unsolicited (i.e., packets that were not requested by the receiver) traffic is separated from solicited traffic at the network connection. The unsolicited traffic arrival patterns are monitored and analyzed for any changes. Once changes in the unsolicited traffic arrival patterns are detected, the changes are mathematically analyzed to detect growth trends. The presence of growth trends that follow certain key characteristics indicate whether the changes are due to worm propagation.
    Type: Application
    Filed: December 4, 2013
    Publication date: June 26, 2014
    Applicant: ALCATEL-LUCENT USA INC.
    Inventors: Tian BU, Aiyou CHEN, Scott Alan VANDER WIEL, Thomas WOO
  • Patent number: 8719267
    Abstract: A processing device of an information processing system is operative to obtain a plurality of records, documents, web pages or other data objects, and to construct a binary tree using a bipartition procedure in which subsets of the data objects are associated with respective nodes of the tree. Evaluation of a designated modularity for a given one of the nodes of the tree is used as a stopping criterion to prevent further partitioning of that node and to indicate designation of that node as a leaf node of the tree. The resulting leaf nodes of the tree provide a non-overlapping partitioning of the plurality of data objects. The processing device is further operative to perform a neighborhood search on the tree to identify pairs of the plurality of data objects that match the same entity, and to store an indication of the matching pairs of data objects.
    Type: Grant
    Filed: April 19, 2010
    Date of Patent: May 6, 2014
    Assignee: Alcatel Lucent
    Inventors: Aiyou Chen, Liangcai Shu, Ming Xiong
  • Patent number: 8666946
    Abstract: A method and apparatus are provided for incrementally tracking quantiles in the presence of multiple record types. A method for performing incremental quantile tracking includes receiving a first data record of a first record type and a second data record of a second record type, and updating a quantile probability for a quantile value, based on the first record type of the first data record and the second record type of the second data record, to obtain a new quantile probability for the quantile value.
    Type: Grant
    Filed: August 24, 2009
    Date of Patent: March 4, 2014
    Assignee: Alcatel Lucent
    Inventors: Tian Bu, Jin Cao, Aiyou Chen, Li Li
  • Publication number: 20140040268
    Abstract: In one aspect, a processing device of an information processing system is operative to perform high-dimensional stratified sampling of a database comprising a plurality of records arranged in overlapping sub-groups. For a given record, the processing device determines which of the sub-groups the given record is associated with, and for each of the sub-groups associated with the given record, checks if a sampling rate of the sub-group is less than a specified sampling rate. If the sampling rate of each of the sub-groups is less than the specified sampling rate, the processing device samples the given record, and otherwise does not sample the given record. The determine, check and sample operations are repeated for additional records, and samples resulting from the sample operations are processed to generate information characterizing the database.
    Type: Application
    Filed: October 15, 2013
    Publication date: February 6, 2014
    Applicant: ALCATEL-LUCENT USA, INC.
    Inventors: Aiyou Chen, MING XIONG
  • Patent number: 8639692
    Abstract: In one aspect, a processing device of an information processing system is operative to perform high-dimensional stratified sampling of a database comprising a plurality of records arranged in overlapping sub-groups. For a given record, the processing device determines which of the sub-groups the given record is associated with, and for each of the sub-groups associated with the given record, checks if a sampling rate of the sub-group is less than a specified sampling rate. If the sampling rate of each of the sub-groups is less than the specified sampling rate, the processing device samples the given record, and otherwise does not sample the given record. The determine, check and sample operations are repeated for additional records, and samples resulting from the sample operations are processed to generate information characterizing the database.
    Type: Grant
    Filed: June 28, 2010
    Date of Patent: January 28, 2014
    Assignee: Alcatel Lucent
    Inventors: Aiyou Chen, Ming Xiong