Patents by Inventor Jinwei Feng

Jinwei Feng has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Loudspeaker arrangement with on-screen voice positioning for telepresence system

Patent number: 9924252

Abstract: A videoconferencing system has a plurality of displays arranged side-by-side. Top loudspeakers are arranged adjacent the tops of the displays, and bottom loudspeakers are arranged adjacent the bottoms of the displays. A control unit operatively coupled to the displays and the loudspeakers routes video to each of the displays and routes audio corresponding to each display to any of the top and bottom loudspeakers arranged adjacent the display. Thus, the top and bottom loudspeakers form a vertical pair of loudspeakers that output the corresponding audio for its respective display. In this way, the audio for the video of a given display is perceived by participants to originate from the center of the given display. If one of the loudspeakers is not provided, gain setting and mixing between adjacent sets of loudspeakers can produce a virtual loudspeaker for the one that is missing.

Type: Grant

Filed: March 7, 2014

Date of Patent: March 20, 2018

Assignee: Polycom, Inc.

Inventors: Michael A. Pocino, Kwan K. Truong, Jinwei Feng, James M. Sharp
SYSTEM AND METHOD FOR LOCALIZING A TALKER USING AUDIO AND VIDEO INFORMATION

Publication number: 20180070053

Abstract: A videoconferencing endpoint includes at least one processor a number of microphones and at least one camera. The endpoint can receive audio information and visual motion information during a teleconferencing session. The audio information includes one or more angles with respect to the microphone from a location of a teleconferencing session. The system evaluates the audio information is evaluated to determine at least one candidate angle corresponding to a possible location of an active talker. The candidate angle can be analyzed further with respect to the motion information to determine whether the candidate angle correctly corresponds to person who is speaking during the teleconferencing session. The person's face can then be framed within a frame view.

Type: Application

Filed: November 9, 2017

Publication date: March 8, 2018

Inventor: Jinwei Feng
System and method for localizing a talker using audio and video information

Patent number: 9912908

Abstract: A videoconferencing endpoint includes at least one processor a number of microphones and at least one camera. The endpoint can receive audio information and visual motion information during a teleconferencing session. The audio information includes one or more angles with respect to the microphone from a location of a teleconferencing session. The audio information is evaluated automatically to determine at least one candidate angle corresponding to a possible location of an active talker. The candidate angle can be analyzed further with respect to the motion information to determine whether the candidate angle correctly corresponds to person who is speaking during the teleconferencing session.

Type: Grant

Filed: December 5, 2016

Date of Patent: March 6, 2018

Assignee: Polycom, Inc.

Inventor: Jinwei Feng
Voice tracking camera with speaker identification

Patent number: 9723260

Abstract: A videoconferencing apparatus automatically tracks speakers in a room and dynamically switches between a controlled, people-view camera and a fixed, room-view camera. When no one is speaking, the apparatus shows the room view to the far-end. When there is a dominant speaker in the room, the apparatus directs the people-view camera at the dominant speaker and switches from the room-view camera to the people-view camera. When there is a new speaker in the room, the apparatus switches to the room-view camera first, directs the people-view camera at the new speaker, and then switches to the people-view camera directed at the new speaker. When there are two near-end speakers engaged in a conversation, the apparatus tracks and zooms-in the people-view camera so that both speakers are in view.

Type: Grant

Filed: May 18, 2010

Date of Patent: August 1, 2017

Assignee: Polycom, Inc.

Inventor: Jinwei Feng
SYSTEM AND METHOD FOR LOCALIZING A TALKER USING AUDIO AND VIDEO INFORMATION

Publication number: 20170085837

Abstract: A videoconferencing endpoint includes at least one processor a number of microphones and at least one camera. The endpoint can receive audio information and visual motion information during a teleconferencing session. The audio information includes one or more angles with respect to the microphone from a location of a teleconferencing session. The audio information is evaluated automatically to determine at least one candidate angle corresponding to a possible location of an active talker. The candidate angle can be analyzed further with respect to the motion information to determine whether the candidate angle correctly corresponds to person who is speaking during the teleconferencing session.

Type: Application

Filed: December 5, 2016

Publication date: March 23, 2017

Inventor: Jinwei Feng
System and method for localizing a talker using audio and video information

Patent number: 9542603

Abstract: A videoconferencing endpoint includes at least one processor a number of microphones and at least one camera. The endpoint can receive audio information and visual motion information during a teleconferencing session. The audio information includes one or more angles with respect to the microphone from a location of a teleconferencing session. The audio information is evaluated automatically to determine at least one candidate angle corresponding to a possible location of an active talker. The candidate angle can be analyzed further with respect to the motion information to determine whether the candidate angle correctly corresponds to person who is speaking during the teleconferencing session.

Type: Grant

Filed: November 17, 2015

Date of Patent: January 10, 2017

Assignee: Polycom, Inc.

Inventor: Jinwei Feng
Videoconferencing endpoint having multiple voice-tracking cameras

Patent number: 9392221

Abstract: A videoconferencing apparatus automatically tracks speakers in a room and dynamically switches between a controlled, people-view camera and a fixed, room-view camera. When no one is speaking, the apparatus shows the room view to the far-end. When there is a dominant speaker in the room, the apparatus directs the people-view camera at the dominant speaker and switches from the room-view camera to the people-view camera. When there is a new speaker in the room, the apparatus switches to the room-view camera first, directs the people-view camera at the new speaker, and then switches to the people-view camera directed at the new speaker. When there are two near-end speakers engaged in a conversation, the apparatus tracks and zooms-in the people-view camera so that both speakers are in view.

Type: Grant

Filed: March 5, 2013

Date of Patent: July 12, 2016

Assignee: Polycom, Inc.

Inventors: Jinwei Feng, Peter Chu, Wayne Dunlap, Jonathan Gallmeier
SYSTEM AND METHOD FOR LOCALIZING A TALKER USING AUDIO AND VIDEO INFORMATION

Publication number: 20160140396

Abstract: A videoconferencing endpoint includes at least one processor a number of microphones and at least one camera. The endpoint can receive audio information and visual motion information during a teleconferencing session. The audio information includes one or more angles with respect to the microphone from a location of a teleconferencing session. The audio information is evaluated automatically to determine at least one candidate angle corresponding to a possible location of an active talker. The candidate angle can be analyzed further with respect to the motion information to determine whether the candidate angle correctly corresponds to person who is speaking during the teleconferencing session.

Type: Application

Filed: November 17, 2015

Publication date: May 19, 2016

Inventor: Jinwei Feng
Automatic microphone muting of undesired noises by microphone arrays

Patent number: 9282405

Abstract: Methods and systems for cancellation of table noise in a speaker system used for video or audio conferencing are disclosed. Table noise is cancelled by using a vertical microphone array to distinguish the tilt angle of sound received by a microphone. If the sound is close to horizontal, the audio is muted. If the sound is above a given angle from horizontal, it is not muted, as this indicates a person speaking. This eliminates paper rustling, keyboard clicks and the like.

Type: Grant

Filed: April 17, 2013

Date of Patent: March 8, 2016

Assignee: Polycom, Inc.

Inventors: Jinwei Feng, Peter L. Chu
Automatic camera selection for videoconferencing

Patent number: 9030520

Abstract: In videoconference camera selection, audio inputs associated with cameras for a videoconference are each processed into first and second audio energies respectively for first and second frequency ranges. The selection then determines which of the audio inputs has a greatest ratio of the first audio energy to the second audio energy and selects the associated camera view for outputting video for the videoconference. The selection can also process video inputs from the cameras either alone or in combination with the audio processing. Either way, the selection processes each of the video inputs for at least one facial characteristic and determines which of the video inputs has a greatest likelihood of framing a human face. In the end, the selection selects the associated camera view for outputting video for the videoconference based at least in part on this video-based determination.

Type: Grant

Filed: June 20, 2011

Date of Patent: May 12, 2015

Assignee: Polycom, Inc.

Inventors: Peter L. Chu, Jinwei Feng, Krishna Sai
Videoconferencing system having adjunct camera for auto-framing and tracking

Patent number: 8842161

Abstract: A videoconference apparatus and method coordinates a stationary view obtained with a stationary camera to an adjustable view obtained with an adjustable camera. The stationary camera can be a web camera, while the adjustable camera can be a pan-tilt-zoom camera. As the stationary camera obtains video, faces of participants are detected, and a boundary in the view is determined to contain the detected faces. Absence and presences of motion associated with the detected face is used to verify whether a face is reliable. To then capture and output video of the participants for the videoconference, the view of the adjustable camera is adjusted to a framed view based on the determined boundary. In the end, active video captured in the framed view with the adjustable camera can be sent to a far-end for the videoconference.

Type: Grant

Filed: August 20, 2012

Date of Patent: September 23, 2014

Assignee: Polycom, Inc.

Inventors: Jinwei Feng, Yibo Liu, Xiangdong Wang, Peter L. Chu
LOUDSPEAKER ARRANGEMENT WITH ON-SCREEN VOICE POSITIONING FOR TELEPRESENCE SYSTEM

Publication number: 20140270302

Abstract: A videoconferencing system has a plurality of displays arranged side-by-side. Top loudspeakers are arranged adjacent the tops of the displays, and bottom loudspeakers are arranged adjacent the bottoms of the displays. A control unit operatively coupled to the displays and the loudspeakers routes video to each of the displays and routes audio corresponding to each display to any of the top and bottom loudspeakers arranged adjacent the display. Thus, the top and bottom loudspeakers form a vertical pair of loudspeakers that output the corresponding audio for its respective display. In this way, the audio for the video of a given display is perceived by participants to originate from the center of the given display. If one of the loudspeakers is not provided, gain setting and mixing between adjacent sets of loudspeakers can produce a virtual loudspeaker for the one that is missing.

Type: Application

Filed: March 7, 2014

Publication date: September 18, 2014

Applicant: POLYCOM, INC.

Inventors: Michael A. Pocino, Kwan K. Truong, Jinwei Feng, James M. Sharp
Scalable audio in a multi-point environment

Patent number: 8831932

Abstract: Use of a scalable audio codec to implement distributed mixing and/or sender bit rate regulation in a multipoint conference is disclosed. The scalable audio codec allows the audio signal from each endpoint to be split into one or more frequency bands and for the transform coefficients within such bands to be prioritized such that usable audio may be decoded from a subset of the entire signal. The subset may be created by omitting certain frequency bands and/or by omitting certain coefficients within the frequency bands. By providing various rules for each endpoint in a conference, the endpoint can determine the importance of its signal to the conference and can select an appropriate bit rate, thereby conserving bandwidth and/or processing power throughout the conference.

Type: Grant

Filed: November 11, 2011

Date of Patent: September 9, 2014

Assignee: Polycom, Inc.

Inventors: Jinwei Feng, Peter L. Chu, Stephen Botzko
Videoconferencing System Having Adjunct Camera for Auto-Framing and Tracking

Publication number: 20140049595

Abstract: A videoconference apparatus and method coordinates a stationary view obtained with a stationary camera to an adjustable view obtained with an adjustable camera. The stationary camera can be a web camera, while the adjustable camera can be a pan-tilt-zoom camera. As the stationary camera obtains video, faces of participants are detected, and a boundary in the view is determined to contain the detected faces. Absence and presences of motion associated with the detected face is used to verify whether a face is reliable. To then capture and output video of the participants for the videoconference, the view of the adjustable camera is adjusted to a framed view based on the determined boundary. In the end, active video captured in the framed view with the adjustable camera can be sent to a far-end for the videoconference.

Type: Application

Filed: August 20, 2012

Publication date: February 20, 2014

Applicant: Polycom, Inc.

Inventors: Jinwei FENG, Yibo LIU, Xiangdong WANG, Peter L. CHU
AUTOMATIC MICROPHONE MUTING OF UNDESIRED NOISES BY MICROPHONE ARRAYS

Publication number: 20130294612

Abstract: Methods and systems for cancelation of table noise in a speaker system used for video or audio conferencing are disclosed. Table noise is cancelled by using a vertical microphone array to distinguish the tilt angle of sound received by a microphone. If the sound is close to horizontal, the audio is muted. If the sound is above a given angle from horizontal, it is not muted, as this indicates a person speaking. This eliminates paper rustling, keyboard clicks and the like.

Type: Application

Filed: April 17, 2013

Publication date: November 7, 2013

Inventors: Jinwei Feng, Peter L. Chu
Videoconferencing Endpoint Having Multiple Voice-Tracking Cameras

Publication number: 20130271559

Abstract: A videoconferencing apparatus automatically tracks speakers in a room and dynamically switches between a controlled, people-view camera and a fixed, room-view camera. When no one is speaking, the apparatus shows the room view to the far-end. When there is a dominant speaker in the room, the apparatus directs the people-view camera at the dominant speaker and switches from the room-view camera to the people-view camera. When there is a new speaker in the room, the apparatus switches to the room-view camera first, directs the people-view camera at the new speaker, and then switches to the people-view camera directed at the new speaker. When there are two near-end speakers engaged in a conversation, the apparatus tracks and zooms-in the people-view camera so that both speakers are in view.

Type: Application

Filed: March 5, 2013

Publication date: October 17, 2013

Inventors: Jinwei Feng, Peter Chu, Wayne Dunlap, Jonathan Gallmeier
Videoconferencing endpoint having multiple voice-tracking cameras

Patent number: 8395653

Abstract: A videoconferencing apparatus automatically tracks speakers in a room and dynamically switches between a controlled, people-view camera and a fixed, room-view camera. When no one is speaking, the apparatus shows the room view to the far-end. When there is a dominant speaker in the room, the apparatus directs the people-view camera at the dominant speaker and switches from the room-view camera to the people-view camera. When there is a new speaker in the room, the apparatus switches to the room-view camera first, directs the people-view camera at the new speaker, and then switches to the people-view camera directed at the new speaker. When there are two near-end speakers engaged in a conversation, the apparatus tracks and zooms-in the people-view camera so that both speakers are in view.

Type: Grant

Filed: May 18, 2010

Date of Patent: March 12, 2013

Assignee: Polycom, Inc.

Inventors: Jinwei Feng, Peter Chu, Wayne Dunlap, Jonathan Gallmeier
Full-band scalable audio codec

Patent number: 8386266

Abstract: A scalable audio codec for a processing device determines first and second bit allocations for each frame of input audio. First bits are allocated for a first frequency band, and second bits are allocated for a second frequency band. The allocations are made on a frame-by-frame basis based on the energy ratio between the two bands. For each frame, the codec transform codes both frequency bands into two sets of transform coefficients, which are then packetized based on the bit allocations. The packets are then transmitted with the processing device. Additionally, the frequency regions of the transform coefficients can be arranged in order of importance determined by power levels and perceptual modeling. Should bit stripping occur, the decoder at a receiving device can produce audio of suitable quality given that bits have been allocated between the bands and the regions of transform coefficients have been ordered by importance.

Type: Grant

Filed: July 1, 2010

Date of Patent: February 26, 2013

Assignee: Polycom, Inc.

Inventors: Jinwei Feng, Peter Chu
Automatic Camera Selection for Videoconferencing

Publication number: 20120320143

Abstract: In videoconference camera selection, audio inputs associated with cameras for a videoconference are each processed into first and second audio energies respectively for first and second frequency ranges. The selection then determines which of the audio inputs has a greatest ratio of the first audio energy to the second audio energy and selects the associated camera view for outputting video for the videoconference. The selection can also process video inputs from the cameras either alone or in combination with the audio processing. Either way, the selection processes each of the video inputs for at least one facial characteristic and determines which of the video inputs has a greatest likelihood of framing a human face. In the end, the selection selects the associated camera view for outputting video for the videoconference based at least in part on this video-based determination.

Type: Application

Filed: June 20, 2011

Publication date: December 20, 2012

Applicant: POLYCOM, INC.

Inventors: Peter L. CHU, Jinwei FENG, Krishna SAI
Scalable Audio in a Multi-Point Environment

Publication number: 20120290305

Abstract: Use of a scalable audio codec to implement distributed mixing and/or sender bit rate regulation in a multipoint conference is disclosed. The scalable audio codec allows the audio signal from each endpoint to be split into one or more frequency bands and for the transform coefficients within such bands to be prioritized such that usable audio may be decoded from a subset of the entire signal. The subset may be created by omitting certain frequency bands and/or by omitting certain coefficients within the frequency bands. By providing various rules for each endpoint in a conference, the endpoint can determine the importance of its signal to the conference and can select an appropriate bit rate, thereby conserving bandwidth and/or processing power throughout the conference.

Type: Application

Filed: November 11, 2011

Publication date: November 15, 2012

Applicant: POLYCOM, INC.

Inventors: Jinwei Feng, Peter L. Chu, Stephen Botzko

prev 1 2 3 next