Patents by Inventor Go Irie
Go Irie has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 12277160Abstract: A search apparatus for searching media data for a target region that matches query data includes a first feature extraction unit configured to extract a first feature vector from the query data using a first trained neural network; a second feature extraction unit configured to obtain a first region from the media data and extract a second feature vector from the first region using a second trained neural network; a localization unit configured to determine a candidate for the target region using a third trained neural network, based on the first feature vector, the second feature vector, and the first region or a location of the first region; and a control unit configured to repeat the operations of the second feature extraction unit and the localization unit until a predetermined condition is satisfied, by using the determined candidate for the target region as the first region to be used by the second feature extraction unit.Type: GrantFiled: September 10, 2019Date of Patent: April 15, 2025Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATIONInventors: Krishna Onkar, Go Irie, Xiaomeng Wu, Takahito Kawanishi, Kunio Kashino
-
Patent number: 12131491Abstract: An acquiring unit of a depth estimation apparatus acquires an input image. In addition, a depth map generating unit inputs the input image acquired by the acquiring unit into a depth estimator for generating, from an image, a depth map in which a depth of a space that appears on the image is imparted to each pixel of the image, and generates an estimated depth map that represents a depth map corresponding to the input image. The depth estimator is a model having been learned in advance so as to reduce, with respect to each error between a depth of the estimated depth map and a depth of a correct-answer depth map that presents the depth map of a correct answer, a value of a loss function set such that a degree of increase of a loss value with respect to a pixel at which the error is larger than a threshold is smaller than a degree of increase of a loss value with respect to a pixel at which the error is equal to or smaller than the threshold.Type: GrantFiled: May 10, 2019Date of Patent: October 29, 2024Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATIONInventors: Go Irie, Takahito Kawanishi, Kunio Kashino
-
Publication number: 20240169262Abstract: A learning device includes: a data input unit configured to receive first data which is a learning target, second data for identifying the first data, and second past data that is used as data for identifying the first data during past learning and relates to learning content to be preserved; a combined data generation unit configured to generate combined data by combining the first data and the second data; and a parameter updating unit configured to update a parameter of a machine learning model based on features of the second past data and the combined data obtained by inputting the combined data and the second past data to the machine learning model.Type: ApplicationFiled: March 23, 2021Publication date: May 23, 2024Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATIONInventors: Takashi SHIBATA, Go IRIE, Daiki IKAMI, Yu MITSUZUMI
-
Patent number: 11816882Abstract: An image identification device can be trained to identify classes with high accuracy even in cases with a small number of learning images.Type: GrantFiled: July 17, 2019Date of Patent: November 14, 2023Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATIONInventors: Go Irie, Yu Mizutsumi
-
Patent number: 11748619Abstract: The purpose of the present invention is to enable learning of a neural network for extracting features of images having high robustness from an undiscriminating image region while minimizing the number of parameters of a pooling layer. A parameter learning unit 130 learns parameters of each layer in a convolutional neural network configured by including a fully convolutional layer for performing convolution of an input image to output a feature tensor of the input image, a weighting matrix estimation layer for estimating a weighting matrix indicating a weighting of each element of the feature tensor, and a pooling layer for extracting a feature vector of the input image based on the feature tensor and the weighting matrix.Type: GrantFiled: June 14, 2019Date of Patent: September 5, 2023Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATIONInventors: Xiaomeng Wu, Go Irie, Kaoru Hiramatsu, Kunio Kashino
-
Patent number: 11727584Abstract: It is possible to receive a point cloud as input and perform shape completion with high accuracy. A shape completion unit inputs an input point cloud and a class identification feature output by a class identification unit to a generator that is learned in advance and generates a shape completion point cloud that is to complete a point cloud and is a set of three-dimensional points by receiving, as input, the point cloud and the class identification feature, gaining an integration result obtained by integrating a global feature that is a global feature based on local features extracted from respective points of the point cloud with the class identification feature, and convoluting the integration result, and outputs the shape completion point cloud that is to complete the input point cloud.Type: GrantFiled: September 12, 2019Date of Patent: August 15, 2023Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATIONInventors: Hidehisa Nagano, Go Irie, Seiya Ito, Kazuhiko Sumi
-
Publication number: 20230169670Abstract: A depth estimation method using a depth estimator trained to output a depth map of a depth provided to each pixel of an input image, in which: the depth estimator includes a pair of a first convolutional layer and a second convolutional layer coupled to each other and configured to, when having received, as input, a tensor obtained by applying predetermined conversion to an input image, apply a two-dimensional convolution operation to the tensor and output the tensor to which the two-dimensional convolution operation is applied; the first convolutional layer is a convolutional layer including a first kernel of a shape having lengths in a first direction and a second direction, the first direction being one of a vertical direction and a horizontal direction, the second direction being different from the first direction, the length in the second direction being longer than the length in the first direction; and the second convolutional layer is a convolutional layer including a second kernel of a shape having lType: ApplicationFiled: April 30, 2020Publication date: June 1, 2023Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATIONInventors: Go IRIE, Daiki IKAMI, Takahito KAWANISHI, Kunio KASHINO
-
Patent number: 11615132Abstract: Low-dimensional feature values with which semantic factors of content are ascertained are generated from relevance between sets of two types of content. Based on a relation indicator indicating a pair of groups indicating which groups are related to first types of content groups among second types of content groups, an initial feature value extracting unit 11 extracts initial feature values of the first type of content and the second type of content. A content pair selecting unit 12 selects a content pair by selecting one first type of content and one second type of content from each pair of groups indicated by the relation indicator. A feature value conversion function generating unit 13 generates feature conversion functions 31 of converting the initial feature values into low-dimensional feature values based on the content pair selected from each pair of groups.Type: GrantFiled: July 8, 2019Date of Patent: March 28, 2023Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATIONInventors: Go Irie, Kaoru Hiramatsu, Kunio Kashino, Kiyoharu Aizawa
-
Patent number: 11520837Abstract: Clustering can be performed using a self-expression matrix in which noise is suppressed. A self-expression matrix is calculated that minimizes an objective function that is for obtaining, from among matrices included in a predetermined matrix set, a self-expression matrix whose elements are linear weights when data points in a data set are expressed by linear combinations of points, the objective function being represented by a term for obtaining the residual between data points in the data set and data points expressed by linear combinations of points using the self-expression matrix, a first regularization term that is multiplied by a predetermined weight and is for reducing linear weights of the data points that have a large Euclidean norm in the self-expression matrix, and a second regularization term for the self-expression matrix. A similarity matrix defined by the calculated self-expression matrix is then calculated.Type: GrantFiled: July 26, 2019Date of Patent: December 6, 2022Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATIONInventors: Masataka Yamaguchi, Go Irie, Kaoru Hiramatsu, Kunio Kashino
-
Publication number: 20220221581Abstract: In a depth estimation device, a generation unit generates a predetermined attractive sound in a space to be measured. A sound pickup unit picks up an acoustic signal for a predetermined time period corresponding to a time period before and after a time of generation of the attractive sound. An estimation unit extracts a feature representing time-frequency information obtained through analysis of the acoustic signal, on the basis of the acoustic signal, and inputs the extracted feature representing the time-frequency information to a depth estimator and generates an estimated depth map for the space to be measured, the depth estimator being composed of one or more convolution operations and being learned so as to output an estimated depth map, in which a depth is assigned to each of pixels of an image representing the space to be measured, when a feature representing the time-frequency information is input.Type: ApplicationFiled: May 21, 2019Publication date: July 14, 2022Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATIONInventors: Go IRIE, Takahito KAWANISHI, Kunio KASHINO
-
Publication number: 20220215567Abstract: An acquiring unit of a depth estimation apparatus acquires an input image. In addition, a depth map generating unit inputs the input image acquired by the acquiring unit into a depth estimator for generating, from an image, a depth map in which a depth of a space that appears on the image is imparted to each pixel of the image, and generates an estimated depth map that represents a depth map corresponding to the input image. The depth estimator is a model having been learned in advance so as to reduce, with respect to each error between a depth of the estimated depth map and a depth of a correct-answer depth map that presents the depth map of a correct answer, a value of a loss function set such that a degree of increase of a loss value with respect to a pixel at which the error is larger than a threshold is smaller than a degree of increase of a loss value with respect to a pixel at which the error is equal to or smaller than the threshold.Type: ApplicationFiled: May 10, 2019Publication date: July 7, 2022Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATIONInventors: Go IRIE, Takahito KAWANISHI, Kunio KASHINO
-
Publication number: 20220188345Abstract: A search apparatus for searching media data for a target region that matches query data includes a first feature extraction unit configured to extract a first feature vector from the query data using a first trained neural network; a second feature extraction unit configured to obtain a first region from the media data and extract a second feature vector from the first region using a second trained neural network; a localization unit configured to determine a candidate for the target region using a third trained neural network, based on the first feature vector, the second feature vector, and the first region or a location of the first region; and a control unit configured to repeat the operations of the second feature extraction unit and the localization unit until a predetermined condition is satisfied, by using the determined candidate for the target region as the first region to be used by the second feature extraction unit.Type: ApplicationFiled: September 10, 2019Publication date: June 16, 2022Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATIONInventors: Krishna ONKAR, Go IRIE, Xiaomeng WU, Takahito KAWANISHI, Kunio KASHINO
-
Publication number: 20220005212Abstract: It is possible to receive a point cloud as input and perform shape completion with high accuracy. A shape completion unit inputs an input point cloud and a class identification feature output by a class identification unit to a generator that is learned in advance and generates a shape completion point cloud that is to complete a point cloud and is a set of three-dimensional points by receiving, as input, the point cloud and the class identification feature, gaining an integration result obtained by integrating a global feature that is a global feature based on local features extracted from respective points of the point cloud with the class identification feature, and convoluting the integration result, and outputs the shape completion point cloud that is to complete the input point cloud.Type: ApplicationFiled: September 12, 2019Publication date: January 6, 2022Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATIONInventors: Hidehisa NAGANO, Go IRIE, Seiya ITO, Kazuhiko SUMI
-
Publication number: 20210303629Abstract: Clustering can be performed using a self-expression matrix in which noise is suppressed. A self-expression matrix is calculated that minimizes an objective function that is for obtaining, from among matrices included in a predetermined matrix set, a self-expression matrix whose elements are linear weights when data points in a data set are expressed by linear combinations of points, the objective function being represented by a term for obtaining the residual between data points in the data set and data points expressed by linear combinations of points using the self-expression matrix, a first regularization term that is multiplied by a predetermined weight and is for reducing linear weights of the data points that have a large Euclidean norm in the self-expression matrix, and a second regularization term for the self-expression matrix. A similarity matrix defined by the calculated self-expression matrix is then calculated.Type: ApplicationFiled: July 26, 2019Publication date: September 30, 2021Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATIONInventors: Masataka YAMAGUCHI, Go IRIE, Kaoru HIRAMATSU, Kunio KASHINO
-
Publication number: 20210295112Abstract: An image identification device can be trained to identify classes with high accuracy even in cases with a small number of learning images.Type: ApplicationFiled: July 17, 2019Publication date: September 23, 2021Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATIONInventors: Go IRIE, Yu MIZUTSUMI
-
Publication number: 20210271702Abstract: Low-dimensional feature values with which semantic factors of content are ascertained are generated from relevance between sets of two types of content. Based on a relation indicator indicating a pair of groups indicating which groups are related to first types of content groups among second types of content groups, an initial feature value extracting unit 11 extracts initial feature values of the first type of content and the second type of content. A content pair selecting unit 12 selects a content pair by selecting one first type of content and one second type of content from each pair of groups indicated by the relation indicator. A feature value conversion function generating unit 13 generates feature conversion functions 31 of converting the initial feature values into low-dimensional feature values based on the content pair selected from each pair of groups.Type: ApplicationFiled: July 8, 2019Publication date: September 2, 2021Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATIONInventors: Go IRIE, Kaoru HIRAMATSU, Kunio KASHINO, Kiyoharu AIZAWA
-
Publication number: 20210256290Abstract: The purpose of the present invention is to enable learning of a neural network for extracting features of images having high robustness from an undiscriminating image region while minimizing the number of parameters of a pooling layer. A parameter learning unit 130 learns parameters of each layer in a convolutional neural network configured by including a fully convolutional layer for performing convolution of an input image to output a feature tensor of the input image, a weighting matrix estimation layer for estimating a weighting matrix indicating a weighting of each element of the feature tensor, and a pooling layer for extracting a feature vector of the input image based on the feature tensor and the weighting matrix.Type: ApplicationFiled: June 14, 2019Publication date: August 19, 2021Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATIONInventors: Xiaomeng WU, Go IRIE, Kaoru HIRAMATSU, Kunio KASHINO
-
Patent number: 8386257Abstract: An audio feature is extracted from audio signal data for each analysis frame and stored in a storage part. Then, the audio feature is read from the storage part, and an emotional state probability of the audio feature corresponding to an emotional state is calculated using one or more statistical models constructed based on previously input learning audio signal data. Then, based on the calculated emotional state probability, the emotional state of a section including the analysis frame is determined.Type: GrantFiled: September 13, 2007Date of Patent: February 26, 2013Assignee: Nippon Telegraph and Telephone CorporationInventors: Go Irie, Kota Hidaka, Takashi Satou, Yukinobu Taniguchi, Shinya Nakajima
-
Publication number: 20090265170Abstract: An audio feature is extracted from audio signal data for each analysis frame and stored in a storage part. Then, the audio feature is read from the storage part, and an emotional state probability of the audio feature corresponding to an emotional state is calculated using one or more statistical models constructed based on previously input learning audio signal data. Then, based on the calculated emotional state probability, the emotional state of a section including the analysis frame is determined.Type: ApplicationFiled: September 13, 2007Publication date: October 22, 2009Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATIONInventors: Go Irie, Kouta Hidaka, Takashi Satou, Yukinobu Taniguchi, Shinya Nakajima