Patents by Inventor Xinnan Yu

Xinnan Yu has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20190340534
    Abstract: The present disclosure provides efficient communication techniques for transmission of model updates within a machine learning framework, such as, for example, a federated learning framework in which a high-quality centralized model is trained on training data distributed overt a large number of clients each with unreliable network connections and low computational power. In an example federated learning setting, in each of a plurality of rounds, each client independently updates the model based on its local data and communicates the updated model back to the server, where all the client-side updates are used to update a global model. The present disclosure provides systems and methods that reduce communication costs. In particular, the present disclosure provides at least: structured update approaches in which the model update is restricted to be small and sketched update approaches in which the model update is compressed before sending to the server.
    Type: Application
    Filed: September 7, 2017
    Publication date: November 7, 2019
    Inventors: Hugh Brendan McMahan, Dave Morris Bacon, Jakub Konecny, Xinnan Yu
  • Publication number: 20190294967
    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for processing inputs using a neural network that includes a circulant neural network layer. One of the methods includes receiving a layer input for the circulant layer; and processing the layer input to generate a layer output for the circulant layer, wherein processing the layer input comprises computing an activation function, wherein the activation function is dependent on the product of the circulant matrix associated with the circulant layer and the layer input, and wherein computing the activation function comprises performing a circular convolution using a Fast Fourier Transform (FFT).
    Type: Application
    Filed: February 3, 2016
    Publication date: September 26, 2019
    Inventors: Sanjiv Kumar, Xinnan Yu
  • Patent number: 10394777
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for efficiently performing linear projections. In one aspect, a method includes actions for obtaining a plurality of content items from one or more content sources. Additional actions include, extracting a plurality of features from each of the plurality of content items, generating a feature vector for each of the extracted features in order to create a search space, generating a series of element matrices based upon the generated feature vectors, transforming the series of element matrices into a structured matrix such that the transformation preserves one or more relationships associated with each element matrix of the series of element matrices, receiving a search object, searching the enhanced search space based on the received search object, provided one or more links to a content item that are responsive to the search object.
    Type: Grant
    Filed: November 25, 2015
    Date of Patent: August 27, 2019
    Assignee: Google LLC
    Inventors: Xinnan Yu, Sanjiv Kumar, Ruiqi Guo
  • Publication number: 20180114145
    Abstract: Techniques of generating input for a kernel-based machine learning system that uses a kernel to perform classification operations on data involve generating unbiased estimators for gaussian kernels according to a new framework called Structured Orthogonal Random Features (SORF). The unbiased estimator KSORF to the kernel involves a linear transformation matrix WSORF computed using products of a set of pairs of matrices, each pair including an orthogonal matrix and respective diagonal matrix whose elements are real numbers following a specified probability distribution. Typically, the orthogonal matrix is a Walsh-Hadamard matrix, the specified probability distribution is a Rademacher distribution, and there are at least two, usually three, pairs of matrices multiplied together to form the linear transformation matrix WSORF.
    Type: Application
    Filed: October 25, 2017
    Publication date: April 26, 2018
    Inventors: Daniel HOLTMANN-RICE, Sanjiv KUMAR, Xinnan YU, Krzysztof Marcin CHOROMANSKI, Ananda Theertha SURESH
  • Publication number: 20180101570
    Abstract: Implementations provide an efficient system for calculating inner products between high-dimensionality vectors. An example method includes clustering database items represented as vectors, selecting a cluster center for each cluster, and storing the cluster center as an entry in a first layer codebook. The method also includes, for each database item, calculating a residual based on the cluster center for the cluster the database item is assigned to and projecting the residual into subspaces. The method also includes determining, for each of the subspaces, an entry in a second layer codebook for the subspace, and storing the entry in the first layer codebook and the respective entry in the second layer codebook for each of the subspaces as a quantized vector for the database item. The entry can be used to categorize an item represented by a query vector or to provide database items responsive to a query vector.
    Type: Application
    Filed: October 11, 2016
    Publication date: April 12, 2018
    Inventors: Sanjiv KUMAR, David Morris SIMCHA, Ananda Theertha SURESH, Ruiqi GUO, Xinnan YU, Daniel HOLTMANN-RICE
  • Publication number: 20180089590
    Abstract: The present disclosure provides systems and methods for communication efficient distributed mean estimation. In particular, aspects of the present disclosure can be implemented by a system in which a number of vectors reside on a number of different clients, and a centralized server device seeks to estimate the mean of such vectors. According to one aspect of the present disclosure, a client computing device can rotate a vector by a random rotation matrix and then subsequently perform probabilistic quantization on the rotated vector. According to another aspect of the present disclosure, subsequent to quantization but prior to transmission, the client computing can encode the quantized vector according to a variable length coding scheme (e.g., by computing variable length codes).
    Type: Application
    Filed: September 19, 2017
    Publication date: March 29, 2018
    Inventors: Ananda Theertha Suresh, Sanjiv Kumar, Hugh Brendan McMahan, Xinnan Yu
  • Publication number: 20180089587
    Abstract: The present disclosure provides systems and methods for communication efficient distributed mean estimation. In particular, aspects of the present disclosure can be implemented by a system in which a number of vectors reside on a number of different clients, and a centralized server device seeks to estimate the mean of such vectors. According to one aspect of the present disclosure, a client computing device can rotate a vector by a random rotation matrix and then subsequently perform probabilistic quantization on the rotated vector. According to another aspect of the present disclosure, subsequent to quantization but prior to transmission, the client computing can encode the quantized vector according to a variable length coding scheme (e.g., by computing variable length codes).
    Type: Application
    Filed: August 14, 2017
    Publication date: March 29, 2018
    Inventors: Ananda Theertha Suresh, Sanjiv Kumar, Hugh Brendan McMahan, Xinnan Yu
  • Patent number: 9870199
    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for augmenting neural networks with an external memory. One of the methods includes receiving a plurality of high-dimensional data items; generating a circulant embedding matrix for the high-dimensional data items, wherein the circulant embedding matrix is a matrix that is fully specified by a single vector; for each high-dimensional data item, generating a compact representation of the high-dimensional data item, comprising computing a product of the circulant embedding matrix and the high dimensional data item by performing a circular convolution of the single vector that fully specifies the circulant embedding matrix and the high dimensional data item using a Fast Fourier Transform (FFT); and generating a compact representation of the high dimensional data item by computing a binary map of the computed product.
    Type: Grant
    Filed: May 12, 2015
    Date of Patent: January 16, 2018
    Assignee: Google LLC
    Inventors: Sanjiv Kumar, Xinnan Yu
  • Publication number: 20170091240
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for efficiently performing linear projections. In one aspect, a method includes actions for obtaining a plurality of content items from one or more content sources. Additional actions include, extracting a plurality of features from each of the plurality of content items, generating a feature vector for each of the extracted features in order to create a search space, generating a series of element matrices based upon the generated feature vectors, transforming the series of element matrices into a structured matrix such that the transformation preserves one or more relationships associated with each element matrix of the series of element matrices, receiving a search object, searching the enhanced search space based on the received search object, provided one or more links to a content item that are responsive to the search object.
    Type: Application
    Filed: November 25, 2015
    Publication date: March 30, 2017
    Inventors: Xinnan Yu, Sanjiv Kumar, Ruiqi Guo
  • Publication number: 20160335053
    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for augmenting neural networks with an external memory. One of the methods includes receiving a plurality of high-dimensional data items; generating a circulant embedding matrix for the high-dimensional data items, wherein the circulant embedding matrix is a matrix that is fully specified by a single vector; for each high-dimensional data item, generating a compact representation of the high-dimensional data item, comprising computing a product of the circulant embedding matrix and the high dimensional data item by performing a circular convolution of the single vector that fully specifies the circulant embedding matrix and the high dimensional data item using a Fast Fourier Transform (FFT); and generating a compact representation of the high dimensional data item by computing a binary map of the computed product.
    Type: Application
    Filed: May 12, 2015
    Publication date: November 17, 2016
    Inventors: Sanjiv Kumar, Xinnan Yu
  • Publication number: 20140222783
    Abstract: Systems and methods for automatically determining an improved view for a visual query in a mobile location or object search are provided. In some embodiments, methods for automatically determining an improved view for a visual query in a mobile location or object search system include obtaining at least one data set based on a prior visual query, wherein the at least one data set includes at least a top location or object and one or more other locations or objects; retrieving at least one distinctiveness measurement for one or more locations or objects in the at least one data set; and determining the improved view based on the retrieved at least one distinctiveness measurement.
    Type: Application
    Filed: April 16, 2012
    Publication date: August 7, 2014
    Applicant: THE TRUSTEES OF COLUMBIA UNIVERSITY IN THE CITY OF NEW YORK
    Inventors: Shih-Fu Chang, Xinnan Yu, Rong-Rong Ji, Tongtao Zhang