Patents by Inventor Yongmian Zhang

Yongmian Zhang has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20190102653
    Abstract: A local connectivity feature transform (LCFT) is applied to binary document images containing text characters, to generate transformed document images which are then input into a bi-directional Long Short Term Memory (LSTM) neural network to perform character/word recognition. The LCFT transformed image is a gray scale image where the pixel values encode local pixel connectivity information of corresponding pixels in the original binary image. The transform is one that provides a unique transform score for every possible shape represented as a 3×3 block. In one example, the transform is computed using a 3×3 weight matrix that combines bit coding with a zigzag pattern to assign weights to each element of the 3×3 block, and by summing up the weights for the non-zero elements of the 3×3 block shape.
    Type: Application
    Filed: September 29, 2017
    Publication date: April 4, 2019
    Applicant: KONICA MINOLTA LABORATORY U.S.A., INC.
    Inventors: Shubham Agarwal, Maral Mesmakhosroshahi, Yongmian Zhang
  • Publication number: 20190065817
    Abstract: An artificial neural network system implemented on a computer for cell segmentation and classification of biological images. It includes a deep convolutional neural network as a feature extraction network, a first branch network connected to the feature extraction network to perform cell segmentation, and a second branch network connected to the feature extraction network to perform cell classification using the cell segmentation map generated by the first branch network. The feature extraction network is a modified VGG network where each convolutional layer uses multiple kernels of different sizes. The second branch network takes feature maps from two levels of the feature extraction network, and has multiple fully connected layers to independently process multiple cropped patches of the feature maps, the cropped patches being located at a centered and multiple shifted positions relative to the cell being classified; a voting method is used to determine the final cell classification.
    Type: Application
    Filed: August 29, 2017
    Publication date: February 28, 2019
    Applicant: KONICA MINOLTA LABORATORY U.S.A., INC.
    Inventors: Maral Mesmakhosroshahi, Shubham Agarwal, Yongmian Zhang
  • Patent number: 10002410
    Abstract: A method, computer readable medium, and system are disclosed of enhancing cell images for analysis. The method includes performing a multi-thresholding process on a cell image to generate a plurality of images of the cell image; smoothing each component within each of the plurality of images; merging the smoothed components into a merger layer; classifying each of the components of the merged layer into convex cell regions and concave cell regions; combining the concave cell regions with a cell boundary for each of the corresponding concave cell regions to generate a smoothed shape profile for each of the concave cell regions; and generating an output image by combining the convex cell regions with the concave cell regions with smoothed shape profiles.
    Type: Grant
    Filed: August 31, 2016
    Date of Patent: June 19, 2018
    Assignee: KONICA MINOLTA LABORATORY U.S.A, INC.
    Inventors: Jingwen Zhu, Yongmian Zhang, Foram Manish Paradkar, Haisong Gu
  • Patent number: 9953215
    Abstract: A method, system and non-transitory computer readable medium are disclosed for recognizing gestures, the method includes capturing at least one three-dimensional (3D) video stream of data on a subject; extracting a time-series of skeletal data from the at least one 3D video stream of data; isolating a plurality of points of abrupt content change called temporal cuts, the plurality of temporal cuts defining a set of non-overlapping adjacent segments partitioning the time-series of skeletal data; identifying among the plurality of temporal cuts, temporal cuts of the time-series of skeletal data having a positive acceleration; and classifying each of the one or more pair of consecutive cuts with the positive acceleration as a gesture boundary.
    Type: Grant
    Filed: August 27, 2015
    Date of Patent: April 24, 2018
    Assignee: KONICA MINOLTA LABORATORY U.S.A., INC.
    Inventors: Quentin Auge, Yongmian Zhang, Haisong Gu
  • Patent number: 9922245
    Abstract: A method, a system, and a non-transitory computer readable medium for recognizing an object. The method includes emitting an array of infrared rays from an infrared emitter towards a projection region, the projection region including a first object; generating a reference infrared image by recording an intensity of ray reflection from the projection region without the first object; generating a target infrared image by recording the intensity of ray reflection from the projection region with the first object; comparing the target infrared image to the reference infrared image to generate a predetermined intensity threshold; and extracting the first object from the target infrared image, if the intensity of ray reflection of the target infrared image of the first object exceeds the predetermined intensity threshold.
    Type: Grant
    Filed: August 15, 2014
    Date of Patent: March 20, 2018
    Assignee: KONICA MINOLTA LABORATORY U.S.A., INC.
    Inventors: Yongmian Zhang, Hung-Shou Tai, Haisong Gu
  • Patent number: 9848170
    Abstract: A method, computer readable storage medium, and system are disclosed for improving communication productivity in a conference between two or more subjects, wherein at least one of the two or more subjects participates in the conference from a first location and one or more of the two or more subjects participate in the meeting from a second location. The method includes capturing, at least one first three-dimensional (3D) stream of data and at least one second three-dimensional (3D) stream of data on each of the two or more subjects participating in the conference; generating a synchrony score for the two or more subjects, wherein the synchrony score is calculated by comparing time series of skeletal data of each of the two or more subjects to one another for a defined period of time; and using the synchrony score to generate an engagement index between the two or more subjects.
    Type: Grant
    Filed: March 28, 2017
    Date of Patent: December 19, 2017
    Assignee: KONICA MINOLTA LABORATORY U.S.A., INC.
    Inventors: Foram Manish Paradkar, Yongmian Zhang, Haisong Gu
  • Patent number: 9823755
    Abstract: A method and system are disclosed for recognizing an object, the method including emitting one or more arranged patterns of infrared rays (IR) from an infrared emitter towards a projection region, the one or more arranged patterns of infrared rays forming unique dot patterns; mapping the one or more arranged patterns of infrared rays on the operation region to generate a reference image; capturing an IR image and a RGB image of an object with a wearable device, the wearable device including an infrared (IR) camera and a RGB camera; extracting IR dots from the IR image and determining a match between the extracted IR dots and the reference image; determining a position of the RGB image on the reference image; and mapping the position of the RGB image to a coordinate on the projection region.
    Type: Grant
    Filed: February 26, 2015
    Date of Patent: November 21, 2017
    Assignee: KONICA MINOLTA LABORATORY U.S.A., INC.
    Inventors: Yongmian Zhang, Jingwen Zhu, Toshiki Ohinata, Haisong Gu
  • Publication number: 20170286760
    Abstract: A method, system and non-transitory computer readable medium are disclosed for recognizing gestures, the method includes capturing at least one three-dimensional (3D) video stream of data on a subject; extracting a time-series of skeletal data from the at least one 3D video stream of data; isolating a plurality of points of abrupt content change called temporal cuts, the plurality of temporal cuts defining a set of non-overlapping adjacent segments partitioning the time-series of skeletal data; identifying among the plurality of temporal cuts, temporal cuts of the time-series of skeletal data having a positive acceleration; and classifying each of the one or more pair of consecutive cuts with the positive acceleration as a gesture boundary.
    Type: Application
    Filed: August 27, 2015
    Publication date: October 5, 2017
    Applicant: Konica Minolta Laboratory U.S.A., Inc.
    Inventors: Quentin AUGE, Yongmian ZHANG, Haisong GU
  • Publication number: 20170201717
    Abstract: A method, computer readable storage medium, and system are disclosed for improving communication productivity in a conference between two or more subjects, wherein at least one of the two or more subjects participates in the conference from a first location and one or more of the two or more subjects participate in the meeting from a second location. The method includes capturing, at least one first three-dimensional (3D) stream of data and at least one second three-dimensional (3D) stream of data on each of the two or more subjects participating in the conference; generating a synchrony score for the two or more subjects, wherein the synchrony score is calculated by comparing time series of skeletal data of each of the two or more subjects to one another for a defined period of time; and using the synchrony score to generate an engagement index between the two or more subjects.
    Type: Application
    Filed: March 28, 2017
    Publication date: July 13, 2017
    Applicant: KONICA MINOLTA LABORATORY U.S.A., INC.
    Inventors: Foram Manish PARADKAR, Yongmian ZHANG, Haisong GU
  • Patent number: 9639770
    Abstract: A method, computer readable storage medium, and system are disclosed for improving communication productivity, comprising: capturing at least one three-dimensional (3D) stream of data on two or more subjects; extracting a time-series of skeletal data from the at least one 3D stream of data on the two or more subjects; and determining an engagement index between the two or more subjects by comparing the time-series of skeletal data on each of the two or more subjects over a time window.
    Type: Grant
    Filed: March 26, 2015
    Date of Patent: May 2, 2017
    Assignee: KONICA MINOLTA LABORATORY U.S.A., INC.
    Inventors: Foram Manish Paradkar, Yongmian Zhang, Haisong Gu
  • Publication number: 20170091907
    Abstract: A method, computer readable medium, and system are disclosed of enhancing cell images for analysis. The method includes performing a multi-thresholding process on a cell image to generate a plurality of images of the cell image; smoothing each component within each of the plurality of images; merging the smoothed components into a merger layer; classifying each of the components of the merged layer into convex cell regions and concave cell regions; combining the concave cell regions with a cell boundary for each of the corresponding concave cell regions to generate a smoothed shape profile for each of the concave cell regions; and generating an output image by combining the convex cell regions with the concave cell regions with smoothed shape profiles.
    Type: Application
    Filed: August 31, 2016
    Publication date: March 30, 2017
    Applicant: KONICA MINOLTA LABORATORY U.S.A., INC.
    Inventors: Jingwen ZHU, Yongmian ZHANG, Foram Manish PARADKAR, Haisong GU
  • Publication number: 20170091948
    Abstract: A method, a computer readable medium, and a system are disclosed for cell segmentation. The method including generating a binary mask from an input image of a plurality of cells, wherein the binary mask separates foreground cells from a background; classifying each of the cell regions of the binary mask into single cell regions, small cluster regions, and large cluster regions; performing, on each of the small cluster regions, a segmentation based on a contour shape of the small cluster region; performing, on each of the large cluster regions, a segmentation based on a texture in the large cluster regions; and outputting an image with cell boundaries.
    Type: Application
    Filed: August 31, 2016
    Publication date: March 30, 2017
    Applicant: Konica Minolta Laboratory U.S.A., Inc.
    Inventors: Foram Manish PARADKAR, Yongmian ZHANG, Jingwen ZHU, Haisong GU
  • Patent number: 9489570
    Abstract: A method and system for recognizing behavior is disclosed, the method includes: capturing at least one video stream of data on one or more subjects; extracting body skeleton data from the at least one video stream of data; computing feature extractions on the extracted body skeleton data to generate a plurality of 3 dimensional delta units for each frame of the extracted body skeleton data; generating a plurality of histogram sequences for each frame by projecting the plurality of 3 dimensional delta units for each frame to a spherical coordinate system having a plurality of spherical bins; generating an energy map for each of the plurality of histogram sequences by mapping the plurality of spherical bins versus time; applying a Histogram of Oriented Gradients (HOG) algorithm on the plurality of energy maps to generate a single column vector; and classifying the single column vector as a behavior and/or emotion.
    Type: Grant
    Filed: December 31, 2013
    Date of Patent: November 8, 2016
    Assignee: Konica Minolta Laboratory U.S.A., Inc.
    Inventors: Chen Cao, Yongmian Zhang, Haisong Gu
  • Publication number: 20160283816
    Abstract: A method, computer readable storage medium, and system are disclosed for improving communication productivity, comprising: capturing at least one three-dimensional (3D) stream of data on two or more subjects; extracting a time-series of skeletal data from the at least one 3D stream of data on the two or more subjects; and determining an engagement index between the two or more subjects by comparing the time-series of skeletal data on each of the two or more subjects over a time window.
    Type: Application
    Filed: March 26, 2015
    Publication date: September 29, 2016
    Applicant: KONICA MINOLTA LABORATORY U.S.A., INC.
    Inventors: Foram Manish PARADKAR, Yongmian ZHANG, Haisong GU
  • Publication number: 20160252976
    Abstract: A method and system are disclosed for recognizing an object, the method including emitting one or more arranged patterns of infrared rays (IR) from an infrared emitter towards a projection region, the one or more arranged patterns of infrared rays forming unique dot patterns; mapping the one or more arranged patterns of infrared rays on the operation region to generate a reference image; capturing an IR image and a RGB image of an object with a wearable device, the wearable device including an infrared (IR) camera and a RGB camera; extracting IR dots from the IR image and determining a match between the extracted IR dots and the reference image; determining a position of the RGB image on the reference image; and mapping the position of the RGB image to a coordinate on the projection region.
    Type: Application
    Filed: February 26, 2015
    Publication date: September 1, 2016
    Applicant: KONICA MINOLTA LABORATORY U.S.A., INC.
    Inventors: Yongmian ZHANG, Jingwen ZHU, Toshiki OHINATA, Haisong GU
  • Patent number: 9355306
    Abstract: A method for recognizing abnormal behavior is disclosed, the method includes: capturing at least one video stream of data on one or more subjects; extracting body skeleton data from the at least one video stream of data; classifying the extracted body skeleton data as normal behavior or abnormal behavior; and generating an alert, if the extracted skeleton data is classified as abnormal behavior.
    Type: Grant
    Filed: September 27, 2013
    Date of Patent: May 31, 2016
    Assignee: KONICA MINOLTA LABORATORY U.S.A., INC.
    Inventors: Dongdong Wu, Yongmian Zhang, Haisong Gu
  • Publication number: 20160078287
    Abstract: A method, system and non-transitory computer readable medium for recognizing gestures are disclosed, the method includes capturing at least one three-dimensional (3D) video stream of data on a subject; extracting a time-series of skeletal data from the at least one 3D video stream of data; isolating a plurality of points of abrupt content change called temporal cuts, the plurality of temporal cuts defining a set of non-overlapping adjacent segments partitioning the time-series of skeletal data; identifying among the plurality of temporal cuts, temporal cuts of the time-series of skeletal data having a positive acceleration; and classifying each of the one or more pair of consecutive cuts with the positive acceleration as a gesture boundary.
    Type: Application
    Filed: August 29, 2014
    Publication date: March 17, 2016
    Applicant: KONICA MINOLA LABORATORY U.S.A., INC.
    Inventors: Quentin AUGE, Yongmian Zhang, Haisong Gu
  • Publication number: 20160048727
    Abstract: A method, a system, and a non-transitory computer readable medium for recognizing an object are disclosed, the method including: emitting an array of infrared rays from an infrared emitter towards a projection region, the projection region including a first object; generating a reference infrared image by recording an intensity of ray reflection from the projection region without the first object; generating a target infrared image by recording the intensity of ray reflection from the projection region with the first object; comparing the target infrared image to the reference infrared image to generate a predetermined intensity threshold; and extracting the first object from the target infrared image, if the intensity of ray reflection of the target infrared image of the first object exceeds the predetermined intensity threshold.
    Type: Application
    Filed: August 15, 2014
    Publication date: February 18, 2016
    Applicant: Konica Minolta Laboratory U.S.A., Inc.
    Inventors: Yongmian ZHANG, Hung-Shou Tai, Haisong Gu
  • Publication number: 20150186713
    Abstract: A method and system for recognizing behavior is disclosed, the method includes: capturing at least one video stream of data on one or more subjects; extracting body skeleton data from the at least one video stream of data; computing feature extractions on the extracted body skeleton data to generate a plurality of 3 dimensional delta units for each frame of the extracted body skeleton data; generating a plurality of histogram sequences for each frame by projecting the plurality of 3 dimensional delta units for each frame to a spherical coordinate system having a plurality of spherical bins; generating an energy map for each of the plurality of histogram sequences by mapping the plurality of spherical bins versus time; applying a Histogram of Oriented Gradients (HOG) algorithm on the plurality of energy maps to generate a single column vector; and classifying the single column vector as a behavior and/or emotion.
    Type: Application
    Filed: December 31, 2013
    Publication date: July 2, 2015
    Applicant: KONICA MINOLTA LABORATORY U.S.A., INC.
    Inventors: Chen CAO, Yongmian ZHANG, Haisong GU
  • Publication number: 20150092978
    Abstract: A method for recognizing abnormal behavior is disclosed, the method includes: capturing at least one video stream of data on one or more subjects; extracting body skeleton data from the at least one video stream of data; classifying the extracted body skeleton data as normal behavior or abnormal behavior; and generating an alert, if the extracted skeleton data is classified as abnormal behavior.
    Type: Application
    Filed: September 27, 2013
    Publication date: April 2, 2015
    Applicant: KONICA MINOLTA LABORATORY U.S.A., INC.
    Inventors: Dongdong WU, Yongmian Zhang, Haisong Gu