Patents by Inventor Zhengyou Zhang

Zhengyou Zhang has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 10146305
    Abstract: The subject disclosure is generally directed towards eye gaze detection based upon both eyes being fixated on the same gaze location. In one aspect, an image including a subject's left and right eyes is captured, from which left and right glint and pupil center information are extracted. The left and right glint data and the left and right pupil data are used with left and right bias correction matrices to jointly determine a gaze location.
    Type: Grant
    Filed: December 22, 2016
    Date of Patent: December 4, 2018
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Zhengyou Zhang, Qin Cai
  • Patent number: 10134416
    Abstract: The privacy-preserving energy-efficient speaker implementations described herein improve user privacy while a user is listening to audio and can reduce the energy necessary to output the audio. This can be done by using parametric speakers and/or traditional loud-speakers. Signal splitting and masking can be used to improve user privacy. Additionally, a signal modulation technique which significantly reduces power requirements to output an audio signal, especially in the context of using parametric speakers, can also be employed.
    Type: Grant
    Filed: May 11, 2015
    Date of Patent: November 20, 2018
    Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: Dinei Florencio, Zhengyou Zhang
  • Publication number: 20180332261
    Abstract: An apparatus that automatically monitors a display device includes a photo sensor configured to receive light from a display screen of the display device. The photo sensor provides signals representing detected light levels to a processor. The processor is coupled to the display device and is configured to cause the display device to present a test sequence including a plurality of images on the display screen. The processor is configured to capture data from the photo sensor during the presentation of the test sequence and to compare the captured data to an expected sequence corresponding to the test sequence displayed by a well-functioning display. The processor is further configured to report any mismatch between the captured data and the expected sequence as a possible malfunction of the display device.
    Type: Application
    Filed: May 9, 2017
    Publication date: November 15, 2018
    Inventors: Zhengyou Zhang, Zicheng Liu, Dinei Afonso Ferreira Florencio, Sasa Junuzovic
  • Publication number: 20180300623
    Abstract: A central server receives a venue identification query from a client device in the venue and a test data set including information collected from the venue. The central server then queries a classifier to identify the venue based on the test data. The classifier returns an identity value (venue ID) and a confidence value for the venue ID. When the confidence value is less than a threshold value, the central server obtains additional data from the client device until the venue is identified. The central server associates the venue ID with the test data set, including the additional data, and adds the test data set to training data for the classifier.
    Type: Application
    Filed: April 17, 2017
    Publication date: October 18, 2018
    Inventors: Zhengyou Zhang, Dinei Afonso Florencio, Sasa Junuzovic, Yinpeng Chen
  • Publication number: 20180255111
    Abstract: A method includes obtaining data representing multiple characters, determining a code for each character wherein each code corresponds to a different audio frequency, and transmitting the codes at the corresponding audio frequencies.
    Type: Application
    Filed: March 1, 2017
    Publication date: September 6, 2018
    Inventors: Zhengyou Zhang, Dinei Afonso Ferreira Florencio, Sasa Junuzovic
  • Publication number: 20180247132
    Abstract: Systems and methods for person counting are disclosed. A method may include retrieving an image frame from a plurality of image frames captured by a camera. The image frame may be split into a grid of a plurality of cells of a pre-determined cell dimensions. The pre-determined cell dimensions may be based on dimensions of the retrieved image frame and reference dimensions of training images of a person detection classifier. At least a portion of the plurality of cells may be rearranged to generate a new image. The new image may be padded with at least one padding strip to adjust dimensions of the new image to the reference dimensions of the training images. Person detection may be performed using the new image and the person detection classifier to obtain a number of persons detected within the new image.
    Type: Application
    Filed: February 28, 2017
    Publication date: August 30, 2018
    Inventors: Zicheng Liu, Yinpeng Chen, Sean E Anderson, Zhengyou Zhang
  • Publication number: 20180232129
    Abstract: Techniques for recording and replay of a live conference while still attending the live conference are described. A conferencing system includes a user interface generator, a live conference processing module, and a replay processing module. The user interface generator is configured to generate a user interface that includes a replay control panel and one or more output panels. The live conference processing module is configured to extract information included in received conferencing data that is associated with one or more conferencing modalities, and to display the information in the one or more output panels in a live manner (e.g., as a live conference). The replay processing module is configured to enable information associated with the one or more conferencing modalities corresponding to a time of the conference session prior to live to be presented at a desired rate, possibly different from the real-time rate.
    Type: Application
    Filed: April 13, 2018
    Publication date: August 16, 2018
    Inventors: Kori Inkpen Quinn, Rajesh Hegde, Zhengyou Zhang, John Tang, Sasa Junuzovic, Christopher Brooks
  • Patent number: 10044982
    Abstract: A tele-immersive environment is described that provides interaction among participants of a tele-immersive session. The environment includes two or more set-ups, each associated with a participant. Each set-up, in turn, includes mirror functionality for presenting a three-dimensional virtual space for viewing by a local participant. The virtual space shows at least some of the participants as if the participants were physically present at a same location and looking into a mirror. The mirror functionality can be implemented as a combination of a semi-transparent mirror and a display device, or just a display device acting alone. According to another feature, the environment may present a virtual object in a manner that allows any of the participants of the tele-immersive session to interact with the virtual object.
    Type: Grant
    Filed: March 28, 2017
    Date of Patent: August 7, 2018
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Andrew D. Wilson, Zhengyou Zhang, Philip A. Chou, Neil S. Fishman, Donald M. Gillett, Hrvoje Benko
  • Patent number: 9990050
    Abstract: The subject disclosure is directed towards a technology by which dynamic hand gestures are recognized by processing depth data, including in real-time. In an offline stage, a classifier is trained from feature values extracted from frames of depth data that are associated with intended hand gestures. In an online stage, a feature extractor extracts feature values from sensed depth data that corresponds to an unknown hand gesture. These feature values are input to the classifier as a feature vector to receive a recognition result of the unknown hand gesture. The technology may be used in real time, and may be robust to variations in lighting, hand orientation, and the user's gesturing speed and style.
    Type: Grant
    Filed: October 25, 2016
    Date of Patent: June 5, 2018
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Zhengyou Zhang, Alexey Vladimirovich Kurakin
  • Patent number: 9977574
    Abstract: Techniques for recording and replay of a live conference while still attending the live conference are described. A conferencing system includes a user interface generator, a live conference processing module, and a replay processing module. The user interface generator is configured to generate a user interface that includes a replay control panel and one or more output panels. The live conference processing module is configured to extract information included in received conferencing data that is associated with one or more conferencing modalities, and to display the information in the one or more output panels in a live manner (e.g., as a live conference). The replay processing module is configured to enable information associated with the one or more conferencing modalities corresponding to a time of the conference session prior to live to be presented at a desired rate, possibly different from the real-time rate.
    Type: Grant
    Filed: July 9, 2014
    Date of Patent: May 22, 2018
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Kori Inkpen Quinn, Rajesh Hegde, Zhengyou Zhang, John Tang, Sasa Junuzovic, Christopher Brooks
  • Patent number: 9971490
    Abstract: The description relates to interactions with a display device. In one example, the interactions can include detecting a user proximate to a display and detecting a non-touch control gesture performed by the user proximate to the display. The example can also include presenting a graphical user interface (GUI) on the display that includes options associated with the control gesture. The example can also include receiving user input selecting one of the options and receiving additional user input from the user to interact with the GUI via the selected one of the options.
    Type: Grant
    Filed: January 14, 2015
    Date of Patent: May 15, 2018
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Yinpeng Chen, Zicheng Liu, Zhengyou Zhang
  • Patent number: 9959627
    Abstract: A three-dimensional shape parameter computation system and method for computing three-dimensional human head shape parameters from two-dimensional facial feature points. A series of images containing a user's face is captured. Embodiments of the system and method deduce the 3D parameters of the user's head by examining a series of captured images of the user over time and in a variety of head poses and facial expressions, and then computing an average. An energy function is constructed over a batch of frames containing 2D face feature points obtained from the captured images, and the energy function is minimized to solve for the head shape parameters valid for the batch of frames. Head pose parameters and facial expression and animation parameters can vary over each captured image in the batch of frames. In some embodiments this minimization is performed using a modified Gauss-Newton minimization technique using a single iteration.
    Type: Grant
    Filed: May 6, 2015
    Date of Patent: May 1, 2018
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Nikolay Smolyanskiy, Christian F. Huitema, Cha Zhang, Lin Liang, Sean Eron Anderson, Zhengyou Zhang
  • Publication number: 20180089884
    Abstract: Implementations provide an in-person communication experience by generating a changable virtual viewpoint for a participant in an online communication. For instance, techniques described herein capture visual data about participants in an online communication, and create a realistic geometric proxy from the visual data. A virtual scene geometry is generated that mimics an arrangement of an in-person meeting for the online communication. According to various implementations, a virtual viewpoint displays a changing virtual viewpoint, such as based on a change in position of a participants face.
    Type: Application
    Filed: December 4, 2017
    Publication date: March 29, 2018
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: Yancey Christopher Smith, Eric G. Lang, Zhengyou Zhang, Christian F. Huitema
  • Patent number: 9883138
    Abstract: The description relates to remote collaboration via a telepresence experience. One example can include an interactive digital display. The example can also include a virtual user presentation component configured to generate a graphical user interface that includes a virtual representation of a remote user on the interactive digital display. The graphical user interface can be configured to present the remote user in a side by side or mirror image relationship to a local user of the interactive digital display.
    Type: Grant
    Filed: July 16, 2014
    Date of Patent: January 30, 2018
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Yinpeng Chen, Philip A. Chou, Zhengyou Zhang
  • Patent number: 9875533
    Abstract: Image processing for productivity applications is provided. An image may be received by a computing device. The computing device may detect the edges comprising the received image and adjust the image based on a skew state of the detected edges. The computing device may then process the adjusted image to correct imbalances. The computing device may then assign an image classification to the processed image. The computing device may then adjust the processed image based on the assigned image classification.
    Type: Grant
    Filed: January 18, 2017
    Date of Patent: January 23, 2018
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Seiichi Kato, Yasufumi Shiraishi, Daniel J. Moffitt, Kenji Nikaido, Daniel Eduardo Martinez, Lu Yuan, Jiangyu Liu, Aleksandar B. Tomic, Ivan Stojiljkovic, Zhengyou Zhang
  • Patent number: 9864430
    Abstract: Examples are disclosed herein that are related to gaze tracking via image data. One example provides, on a gaze tracking system comprising an image sensor, a method of determining a gaze direction, the method comprising acquiring image data via the image sensor, detecting in the image data facial features of a human subject, determining an eye rotation center based upon the facial features using a calibrated face model, determining an estimated position of a center of a lens of an eye from the image data, determining an optical axis based upon the eye rotation center and the estimated position of the center of the lens, determining a visual axis by applying an adjustment to the optical axis, determining the gaze direction based upon the visual axis, and providing an output based upon the gaze direction.
    Type: Grant
    Filed: January 9, 2015
    Date of Patent: January 9, 2018
    Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: Dijia Wu, Michael J. Conrad, Tim Burrell, Xu Miao, Zicheng Liu, Qin Cai, Zhengyou Zhang
  • Patent number: 9836870
    Abstract: A perspective-correct communication window system and method for communicating between participants in an online meeting, where the participants are not in the same physical locations. Embodiments of the system and method provide an in-person communications experience by changing virtual viewpoint for the participants when they are viewing the online meeting. The participant sees a different perspective displayed on a monitor based on the location of the participant's eyes. Embodiments of the system and method include a capture and creation component that is used to capture visual data about each participant and create a realistic geometric proxy from the data. A scene geometry component is used to create a virtual scene geometry that mimics the arrangement of an in-person meeting. A virtual viewpoint component displays the changing virtual viewpoint to the viewer and can add perceived depth using motion parallax.
    Type: Grant
    Filed: April 13, 2016
    Date of Patent: December 5, 2017
    Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: Yancey Christopher Smith, Eric G. Lang, Zhengyou Zhang, Christian F. Huitema
  • Patent number: 9756284
    Abstract: The described implementations relate to enhancement images, such as in videoconferencing scenarios. One system includes a poriferous display screen having generally opposing front and back surfaces. This system also includes a camera positioned proximate to the back surface to capture an image through the poriferous display screen.
    Type: Grant
    Filed: August 3, 2015
    Date of Patent: September 5, 2017
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Cha Zhang, Timothy A. Large, Zhengyou Zhang, Ruigang Yang
  • Publication number: 20170251180
    Abstract: In embodiments of collaborative camera viewpoint control for interactive telepresence, a system includes a vehicle that travels based on received travel instructions, and the vehicle includes a camera system of multiple cameras that each capture video of an environment in which the vehicle travels from different viewpoints. Viewing devices receive the video of the environment from the different viewpoints, where the video of the environment from a selected one of the viewpoints is displayable to users of the viewing devices. Controller devices that are associated with the viewing devices can each receive a user input as a proposed travel instruction for the vehicle based on the selected viewpoint of the video that is displayed on the viewing devices. A trajectory planner receives the proposed travel instructions initiated via the controller devices, and generates a consensus travel instruction for the vehicle based on the proposed travel instructions.
    Type: Application
    Filed: February 29, 2016
    Publication date: August 31, 2017
    Inventors: Nikolai Smolyanskiy, Zhengyou Zhang, Vikram R. Dendi, Michael Hall
  • Publication number: 20170251179
    Abstract: A vehicle has a camera system that captures video while the vehicle moves. The vehicle records the captured video and/or wirelessly transmits the captured video to a remote user device for playback. When the vehicle is moving, a coarse waypoint is identified and a trajectory is determined from the current location of the vehicle to the coarse waypoint that reduces (e.g., minimizes) sudden changes in direction of movement of the vehicle, reduces (e.g., minimizes) sudden changes in speed of the vehicle, and/or reduces (e.g., minimizes) sudden changes in acceleration of the vehicle by reducing (e.g., minimizing) jerk or snap of the vehicle trajectory. One or more fine waypoints along the trajectory are selected and the vehicle moves to the coarse waypoint along the trajectory by passing through those fine waypoints, resulting in smooth movement of the device that reduces or eliminates motion sickness for users viewing the captured video.
    Type: Application
    Filed: February 29, 2016
    Publication date: August 31, 2017
    Inventors: Nikolai Smolyanskiy, Zhengyou Zhang, Vikram R. Dendi