Patents by Inventor Li-Wei He
Li-Wei He has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20080071920Abstract: Media content is streamed from a server computer to a client computer. A media file format is used to store data for multiple timeline-altered streams that provides support for switching between the different timeline-altered streams during their presentation. In one aspect, a user can switch between different timeline-altered streams. Upon receiving a user request to switch to a particular timeline-altered stream (the target stream), the client computer accesses a time code stream data object corresponding to the current data unit being presented. The time code stream data object identifies a primary stream presentation time corresponding to the data unit. The client then uses the primary stream presentation time to index into a table of mappings to byte offsets. The table provides a mapping of the primary stream presentation time to a corresponding byte offset of the target stream.Type: ApplicationFiled: November 26, 2007Publication date: March 20, 2008Applicant: Microsoft CorporationInventors: Anoop Gupta, Nosakhare Omoigui, Li-Wei He
-
Patent number: 7337108Abstract: An adaptive “temporal audio scaler” is provided for automatically stretching and compressing frames of audio signals received across a packet-based network. Prior to stretching or compressing segments of a current frame, the temporal audio scaler first computes a pitch period for each frame for sizing signal templates used for matching operations in stretching and compressing segments. Further, the temporal audio scaler also determines the type or types of segments comprising each frame. These segment types include “voiced” segments, “unvoiced” segments, and “mixed” segments which include both voiced and unvoiced portions. The stretching or compression methods applied to segments of each frame are then dependent upon the type of segments comprising each frame. Further, the amount of stretching and compression applied to particular segments is automatically variable for minimizing signal artifacts while still ensuring that an overall target stretching or compression ratio is maintained for each frame.Type: GrantFiled: September 10, 2003Date of Patent: February 26, 2008Assignee: Microsoft CorporationInventors: Dinei Florencio, Philip Chou, Li-Wei He
-
Patent number: 7313808Abstract: A program distribution system includes a plurality of set-top boxes that receive broadcast programming and segmentation data from content and information providers. The segmentation information indicates portions of programs that are to be included in skimmed or condensed versions of the received programming, and is produced using manual or automated methods. Automated methods include the use of ancillary production data to detect the most important parts of a program. A user interface allows a user to control time scale modification and skimming during playback, and also allows the user to easily browse to different points within the current program.Type: GrantFiled: May 5, 2000Date of Patent: December 25, 2007Assignee: Microsoft CorporationInventors: Anoop Gupta, Li-Wei He, Francis C. Li, Yong Rui
-
Publication number: 20070286520Abstract: Background blurring is an effective way to both preserve privacy and keep communication effective during video conferencing. The present image background blurring technique is a light weight real-time technique to perform background blurring using a fast background modeling procedure combined with an object (e.g., face) detector/tracker. A soft decision is made at each pixel whether it belongs to the foreground or the background based on multiple vision features. The classification results are mapped to a per-pixel blurring radius image to blur the background. In another embodiment, the image background blurring technique blurs the background of the image without using the object detector.Type: ApplicationFiled: June 7, 2006Publication date: December 13, 2007Applicant: Microsoft CorporationInventors: Cha Zhang, Li-wei He, Yong Rui
-
Patent number: 7301548Abstract: This invention is directed toward a system and method for scanning a scene or object such as a whiteboard, paper document or similar item. More specifically, the invention is directed toward a system and method for obtaining a high-resolution image of a whiteboard or other object with a low-resolution camera. The system and method of the invention captures either a set of snapshots with overlap or a continuous video sequence, and then stitches them automatically into a single high-resolution image. The stitched image can finally be exported to other image processing systems and methods for further enhancement.Type: GrantFiled: December 11, 2004Date of Patent: November 27, 2007Assignee: Microsoft Corp.Inventors: Zhengyou Zhang, Li-wei He
-
Patent number: 7302490Abstract: In a networked client/server system, media content is streamed from a server computer to a client computer. A media file format is used to store data for multiple timeline-altered streams that provides support for switching between the different timeline-altered streams during their presentation. According to one embodiment, a time code stream includes multiple data objects mapping corresponding timeline-altered stream data units to primary stream presentation times, and an index table mapping primary stream presentation times to timeline-altered stream byte offsets.Type: GrantFiled: May 3, 2000Date of Patent: November 27, 2007Assignee: Microsoft CorporationInventors: Anoop Gupta, Nosakhare D. Omoigui, Li-Wei He
-
Publication number: 20070269105Abstract: A system that captures both whiteboard content and audio signals of a meeting using a digital camera and a microphone. The system can be retrofit to any existing whiteboard. It computes the time stamps of pen strokes on the whiteboard by analyzing the sequence of captured snapshots. It also automatically produces a set of key frames representing all the written content on the whiteboard before each erasure. The whiteboard content serves as a visual index to efficiently browse the audio meeting. The system not only captures the whiteboard content, but also helps the users to view and manage the captured meeting content efficiently and securely.Type: ApplicationFiled: August 31, 2006Publication date: November 22, 2007Applicant: Microsoft CorporationInventors: Zhengyou Zhang, Ross Cutler, Zicheng Lui, Anoop Gupta, Li-wei He
-
Patent number: 7293280Abstract: A program distribution system includes a plurality of set-top boxes that receive broadcast programming and segmentation data from content and information providers. The segmentation information indicates portions of programs that are to be included in skimmed or condensed versions of the received programming, and is produced using manual or automated methods. Automated methods include the use of ancillary production data to detect the most important parts of a program. A user interface allows a user to control time scale modification and skimming during playback, and also allows the user to easily browse to different points within the current program.Type: GrantFiled: May 5, 2000Date of Patent: November 6, 2007Assignee: Microsoft CorporationInventors: Anoop Gupta, Li-Wei He, Francis C. Li, Yong Rui
-
Publication number: 20070237099Abstract: A decentralized computer network architecture and method that gathers metadata from local and remote clients and, based on that metadata, locally makes a decision whether to send a packet over the network. Each client listens to what other clients are doing, and only sends when the total number of concurrent speakers is below some threshold. In a multi-party voice conferencing embodiment, the threshold is a number of concurrent speakers that is restricted to less than a certain number. Under the decentralized computer network architecture, the type of network topology used to connect the clients is flexible, as long as each client is running a peer-aware system to decide locally whether to send their packets. The decentralized computer network architecture and method is distributed to run on each client, making it suitable for a wide variety of network topologies (such as full-mesh, bridge-based, or a hybrid of the two).Type: ApplicationFiled: March 29, 2006Publication date: October 11, 2007Applicant: Microsoft CorporationInventors: Li-wei He, Dinei Florencio, Yong Rui
-
Publication number: 20070230372Abstract: A peer-aware voice stream ranking method that makes decisions based on information about participants of a voice conference over a network. Whether to send a participant's own audio packet out on the network is based both on information about the participant's own voice packet and voice packets that the participant receives from other clients. A Voice Activity Score (VAS) is computed for each frame of a particular voice stream. The VAS includes a voiceness component, indicating the likelihood that the audio frame contains speech or voice, and an energy level component that indicating the ratio of current frame energy to the long-term average of energy for a current speaker. Using the VAS from the participants, the method also ranks the client's voice stream as compared to other clients' voice streams in the voice conference. If there are participants higher ranking, the client's voice stream is not sent.Type: ApplicationFiled: March 29, 2006Publication date: October 4, 2007Applicant: Microsoft CorporationInventors: Li-wei He, Dinei Florencio, Xun Xu
-
Patent number: 7260257Abstract: A system that captures both whiteboard content and audio signals of a meeting using a digital camera and a microphone. The system can be retrofit to any existing whiteboard. It computes the time stamps of pen strokes on the whiteboard by analyzing the sequence of captured snapshots. It also automatically produces a set of key frames representing all the written content on the whiteboard before each erasure. The whiteboard content serves as a visual index to efficiently browse the audio meeting. The system not only captures the whiteboard content, but also helps the users to view and manage the captured meeting content efficiently and securely.Type: GrantFiled: June 19, 2002Date of Patent: August 21, 2007Assignee: Microsoft Corp.Inventors: Zhengyou Zhang, Ross Cutler, Zicheng Liu, Anoop Gupta, Li-wei He
-
Patent number: 7243140Abstract: A skimmed or preview version of multimedia content is provided to a client computer by a server computer using playlists. The skimmed version of multimedia content can be presented to a user of a client computer in less time than presenting the entire multimedia content would require. The server computer maintains skimming information that identifies particular segments of the multimedia content corresponding to the skimmed version. The server computer uses the skimming information to generate a playlist, which in turn is used by the server computer to access the appropriate segments of the multimedia content and provide the segments to the client computer.Type: GrantFiled: February 28, 2006Date of Patent: July 10, 2007Assignee: Microsoft CorporationInventors: Anoop Gupta, Li-Wei He, Nosakhare D. Omoigui
-
Publication number: 20070126755Abstract: A system that captures both whiteboard content and audio signals of a meeting using a digital camera and a microphone. The system can be retrofit to any existing whiteboard. It computes the time stamps of pen strokes on the whiteboard by analyzing the sequence of captured snapshots. It also automatically produces a set of key frames representing all the written content on the whiteboard before each erasure. The whiteboard content serves as a visual index to efficiently browse the audio meeting. The system not only captures the whiteboard content, but also helps the users to view and manage the captured meeting content efficiently and securely.Type: ApplicationFiled: November 30, 2006Publication date: June 7, 2007Applicant: Microsoft CorporationInventors: Zhengyou Zhang, Ross Cutler, Zicheng Liu, Anoop Gupta, Li-wei He
-
Publication number: 20070120979Abstract: A combined digital and mechanical tracking system and process for generating a video using a single digital video camera that tracks a person or object of interest moving in a scene is presented. This generally involves operating the camera at a higher resolution than is needed for the application, and cropping a sub-region out of the image captured that is output as the output video. The person or object being tracked is at least partially contained within the cropped sub-region. As the person or object moves within the field of view of the camera, the location of the cropped sub-region is also moved so as to keep the subject of interest within its boundaries. When the subject of interest moves to the boundary of the FOV of the camera, the camera is mechanically panned to keep the person or object inside its FOV.Type: ApplicationFiled: November 21, 2005Publication date: May 31, 2007Applicant: Microsoft CorporationInventors: Cha Zhang, Li-wei He, Yong Rui
-
Patent number: 7224847Abstract: A system and method for streaming whiteboard content to computing devices in a networked environment. The invention is an extension of whiteboard image generation technology to provide network-based collaboration of a target meeting. In one aspect, each networked client can receive audio content and whiteboard content (video images). In another aspect, each networked client can transmit audio content and annotation content which is displayed separately or generated on the whiteboard image. The streaming content is built on external collaboration frameworks.Type: GrantFiled: June 17, 2003Date of Patent: May 29, 2007Assignee: Microsoft Corp.Inventors: Zhengyou Zhang, Li-wei He
-
Patent number: 7171056Abstract: A system and method of converting the content of a regular whiteboard into an electronic document. This system and method provides an effective, economical and easy to use way to convert whiteboard content as an enhanced and easily editable electronic document though the use of a still or video camera. It uses a series of image processing steps to clip borders, rectify the whiteboard image and correct colors.Type: GrantFiled: February 22, 2003Date of Patent: January 30, 2007Assignee: Microsoft Corp.Inventors: Zhengyou Zhang, Li-wei He
-
Publication number: 20060277052Abstract: A method and system for modifying a digital audio signal to vary its playback speed while preserving the signal's pitch and quality. The variable speed playback (VSP) system and method mitigates artifacts remaining after processing by existing techniques. The VSP system and method produces a consistent and pleasing sound to an audio file, even while its speed is varied during playback. The VSP method includes selecting and estimating an input frame, adjusting the frame position, and overlapping and adding the adjust frame to an output signal. The frame position adjustment is achieved using an enhanced correlation technique that finds all local maxima over a cross-correlation function. The local maxima having a highest correlation score is designated as a cut position, where the adjusted frame is cut from the input buffer. The VSP system and method using four input frames to generate one output frame.Type: ApplicationFiled: June 1, 2005Publication date: December 7, 2006Applicant: Microsoft CorporationInventors: Li-wei He, Dinei Florencio
-
Publication number: 20060268101Abstract: A method of digitally adding the appearance of makeup to a videoconferencing participant. The system and method for applying digital make-up operates in a loop processing sequential video frames. For each input frame, there are typically three general steps: 1) Locating the face and eye and mouth regions; 2) Applying digital make-up to the face, preferably with the exception of the eye and open mouth areas; and 3) Blending the make-up region with the rest of the face. In one embodiment of the invention, the background in the frame containing a video conferencing participant can also be modified so that other video conferencing participants cannot clearly see the background behind the participant in the image frame. In one such embodiment of the invention, the video conferencing participant tries to make his or her own image look comical or altered. In another embodiment of the invention, a particular remote participant tries to make another participant look funny to the other participants.Type: ApplicationFiled: May 25, 2005Publication date: November 30, 2006Applicant: Microsoft CorporationInventors: Li-wei He, Michael Cohen, Yong Rui, Shinichi Manaka
-
Patent number: 7119816Abstract: This invention is directed toward a system and method for scanning a scene or object such as a whiteboard, paper document or similar item. More specifically, the invention is directed toward a system and method for obtaining a high-resolution image of a whiteboard or other object with a low-resolution camera. The system and method of the invention captures either a set of snapshots with overlap or a continuous video sequence, and then stitches them automatically into a single high-resolution image. The stitched image can finally be exported to other image processing systems and methods for further enhancement.Type: GrantFiled: March 31, 2003Date of Patent: October 10, 2006Assignee: Microsoft Corp.Inventors: Zhengyou Zhang, Li-wei He
-
Publication number: 20060182100Abstract: The subject invention relates to systems and methods that facilitate direct network communications between peers that operate behind Network Address Translators. In one aspect, a network communications system is provided. The system includes one or more Network Address Translators (NAT) to communicate data across a network between peers. A protocol selection component that automatically selects among a plurality of protocols according to one or more NAT types in order to determine a subset of the protocols that facilitate communications between the peers.Type: ApplicationFiled: February 11, 2005Publication date: August 17, 2006Applicant: Microsoft CorporationInventors: Jin Li, Li-Wei He