WEB PAGE BASED VIDEO SERVICE AND APPARATUS
A telepresence server is for connection to a telecommunications network for providing access to a reality engine for a plurality of passive users. The reality engine can be controlled by an active user or a professional director through the network or by a local professional director. Active/passive mode displays can be used by the passive users in a passive mode and an active/passive mode display can be used by the active user in an active mode.
The present application is a continuation of U.S. patent application Ser. No. 09/177,356 filed Oct. 23, 1998 and which claims priority from U.S. Provisional Application Ser. No. 60/063,232 filed Oct. 23, 1997.
BACKGROUND OF THE INVENTION1. Field of the Invention
The invention relates to communication of images and, more particularly, to telepresence, including remote video monitoring.
2. Discussion of Related Art
Remote monitoring systems are known to include remotely located video cameras positioned for monitoring from a remote site with a personal computer or display. Such can be connected by any kind of connection such as point-to-point with a telephone line, via the internet or through an internet hub. A video server is used to capture successive real time images from a video camera, digitize and compress them and transfer them frame-by-frame through the internet, intranet or point-to-point protocol direct dial-in connection.
Telepresence is similar in concept to “virtual reality” except images and other stimuli are provided to the user via a connection in a telecommunications network. One approach uses a teleoperated camera platform coupled to the head movements of a remote user wearing a head-tracked, head-mounted display (HTHMD). See U.S. Pat. No. 5,436,638 at column 1, lines 43-48 and column 3, lines 10-31. Instead of a HTHMD, a desktop display can be yoked to the movements of a user seated before the display such as shown in FIGS. 13, 14A, 14B and 16 of U.S. Pat. No. 5,436,638. See also the PUSH desktop display and the BOOM3C head-coupled stereoscopic display, either hand-guided or hands-free (head-guided), of Fakespace, Inc., Menlo Park, Calif. Another approach is to use a remote reality engine with prerecorded scenarios for selection over the network according to monitored movements of the user. Due to the limited bandwidth typically available for such connections, the rate of frame delivery is very slow and therefore there is a noticeable lag between the time of image capture or retrieval and display. Moreover, the amount of video information conveyed is rather limited since the technology is based on the existing NTSC infrastructure. Consequently, the above described applications for telepresence tend to be lacking in the “presence” aspect and likewise remote viewing tends to be confined to rather static, e.g., industrial plant process monitoring, employee parking lot monitoring, security monitoring for plant ingress/egress and the like.
However, various competing transport technologies are now being deployed to increased the bandwidth enormously and thereby speed up such connections. These include optical fiber networks, cable, satellite and techniques to utilize the existing telephony infrastructure of twisted copper pairs as digital subscribers lines. Included in the services deliverable on the links provided according to such technologies will be HDTV. While the bandwidth of such links now being deployed to subscribers can be heavily proportioned in the downstream direction, they also provide at least a significant amount of upstream bandwidth. As a result, there will now be new opportunities for far more dynamic types of telepresence applications, including remote video monitoring, particularly on the Internet, and in ways heretofore never even contemplated. In particular, it can be foreseen that there will be extremely high demand for exciting, new telepresence application.
Unfortunately, these telepresence applications suffer from an underlying assumption borrowed from the art of “virtual reality” where the user is enabled to navigate within a virtual environment in a highly autonomous manner. The user takes command of the virtual environment and actively controls all of the responses of the reality engine according to monitored activity of the user. This dedication to a single user of the tools needed to generate the virtual environment makes the reality engine unavailable to all but this one user at a given time. A similar situation exists for a remotely located video camera. Since these tools are quite expensive, the cost of use for the single user is high. Hence the anticipated demand cannot be efficiently and economically met.
SUMMARY OF THE INVENTIONAs object of the present invention is to provide a new type of telepresence, including remote monitoring, that takes advantage of the increased bandwidth on links now being deployed.
Another object of the present invention is to provide telepresence to more than one user at a given time.
According to a first aspect of the present invention, a system for providing video images, comprises a video camera for providing video signals indicative of said video images captured by said video camera, a first display, responsive to said video signals, for providing said video images for viewing by a first user, an n-axis sensor, responsive to n-axis first display motions caused by said first user, for providing an n-axis attitude control signal, an n-axis platform having said video camera mounted thereon, responsive to said n-axis attitude command signal, for executing n-axis platform motions emulative of said n-axis first display motions, and one or more second displays, responsive to said video signals, for providing said video images for viewing by one or more corresponding second users and responsive to said n-axis attitude command signal for executing n-axis second display motions emulative of said n-axis first display motions.
According to a second aspect of the present invention, a system comprises at least one reality engine for providing an image signal indicative of images taken from various attitudes, and a telepresence server, responsive to said image signal, for providing said image signal and an attitude control signal to at least one attitudinally actuatable display via a telecommunications network for attitudinally actuating said display for guiding a viewing attitude of a user and for displaying said images for said user of said at least one attitudinally actuatable display for passively viewing said images from said various attitudes. The telepresence server can be for providing access to said reality engine for an active user of a display attitudinally actuatable by said active user for providing said attitude control signal to said reality engine and to said telepresence server wherein the user is drawn from the general public with no special training. Or, the telepresence server can be for providing access to said reality engine for a trained director who can be local, not needing network access to the server, or remote, needing to access via a network.
According to a third aspect of the present invention, a display device comprises an n-axis display platform, responsive in a passive mode to an attitudinal control signal, for guiding a user's head to execute attitudinal movements, and responsive in an active mode to attitudinal movements of a user's head for providing sensed signals indicative of said attitudinal movements, and a display connected to said n-axis display platform, responsive to a video signal, for displaying images corresponding to said attitudinal movements.
These and other objects, features and advantages of the present invention will become more apparent in light of the following detailed description of a best mode embodiment thereof, as illustrated in the accompanying drawing.
BRIEF DESCRIPTION OF THE DRAWING
Given the typical bandwidth limitations of existing methods, such as methods for accessing the internet and other similar connections, this way of remote video monitoring has been found to be effective for rather static type applications such as security monitoring. E.g., a security officer sits before a PC or other display (or bank or displays) and monitors the desired points in various plants of a company from a single remote monitoring site. For this sort of application, a need for a large amount of bandwidth is not particularly important and hence the proven success of such relatively static applications. On the other hand, more dynamic remote video monitoring applications, such as entertainment or education, cannot be expected to be viable using such limited bandwidth connections.
Telepresence concepts are shown implemented in
Considering the enormously increased bandwidth provided by the WISP 42 e.g., 7 or 8 Mbit/sec compared to 33 kbit/sec for the modem 16 of
Various head mounted displays are known. One type is a see-through display where the real world view of the user is “augmented” with imagery from an image source, called “augmented reality”. Another type completely blocks light from the outside and is for use in a completely virtual environment. Yet another type is a “video see-through” where the user wears stereo cameras on his head which provide images for perception of the surroundings using a head mounted display. All of these types of HMDs can be used to implement the present invention. However, many of these displays use bulky optics and related heavy cables which are somewhat burdensome. Moreover, presently available optics have a rather narrow field of view and present video image resolution is rather poor.
A particularly attractive recent innovation for the purposes of the present invention is the retinal display which does away with the external display and the associated optics entirely. There is no comparable problem with narrow field of view and low resolution with a retinal display. A retinal display has been disclosed for providing a scanning light signal for the formation of images firstly and directly in the eye of a viewer: U.S. Pat. No. 5,467,104 shows the projection of a modulated scanning light signal directly onto the retina of the viewer's eye without the prior formation of any real or aerial image outside the viewer's eye. In other words, light rays do not converge in any way outside the eye to form an image. That patent shows modulated photons of the light signal reflected from one or more scanners by way of projection optics directly onto the retina. A micromechanical scanner can be used as the scanning device, as shown in U.S. Pat. No. 5,557,444 (based on U.S. patent application Ser. No. 08/329,508, filed Oct. 26, 1994). An optical fiber may be used to provide the light signal from the photon source to the scanner as shown in U.S. Pat. No. 5,596,339 in order to promote a lightweight, head mounted, panoramic display.
In addition to the HMDs 56, 58, 60, a respective plurality of attitude sensors 62, 64, 66 are shown for mounting on the head of the user for sensing the rotational movements of the user's head and providing a sensed signal on a line 68, 70, 72, respectively, to interfaces 50, 52, 54 for upstream transmission. Such a device for determining orientation of a user's head using accelerometers is shown in U.S. Pat. No. 5,615,132 to Horton et al. Another is shown in U.S. Pat. No. 5,645,077 to Foxlin. Yet another is provided by Precision Navigation, Inc., 1235 Pear Avenue, Suite 111, Mountain View, Calif. 94043. For a simple case, it is assumed that translatory position (translation) of the user's head is not measured or, if measured, is ignored. A further simplification reduces the number of rotational degrees of freedom that are measured from three to two (e.g., pan (yaw) and tilt (pitch) as described below), or even just one. This simplification does exclude the measurement of translations, however. The WISP 42 is connected by a signal on a line 74 and via the internet 76 and a signal on a line 78 to another WISP 80 connected in turn to a plurality of video servers 82, 84 86 signals on lines 87, 88, 89. It should be realized that there need not be two separate WISP's 42, 80, but that in certain circumstances one can suffice. The video servers are connected to a corresponding plurality of cameras 90, 91, 92 by a plurality of signal lines 94, 96, 98. The cameras 90, 91, 92 send video signals via the internet 76 to the HMDs 56, 58, 60, respectively, for display.
In the opposite direction, the interfaces 50, 52, 54 transmit attitude command signals in response to the corresponding sensed attitude signals on the lines 68, 70,72 from the attitude sensors 62, 64, 66 through the WISP 42, the internet 76, the WISP 80 and the plurality of video servers 82, 84, 86 to a corresponding plurality of n-axis platforms such as three axis platforms 100, 102, 104.
The platforms 100, 102, 104 need not be three-axis, i. e., including pitch, roll and yaw but may be restricted to only two axes (e. g., pitch and yaw) or even just one (e. g., yaw). For instance, if roll is omitted, a 2-axis platform in the form of a computer controlled pan-tilt (2-axis: yaw-pitch) unit, Model PTU-46-70 or PTU-46-17.5, produced by Directed Perception, Inc., 1485 Rollins Road, Burlingame, Calif. 94010 may be used. Actuators from other manufacturers such as Densitron may be used as well. In addition to one or more of the three attitudinal degrees of freedom, one or more of the three translational degrees of freedom may also be added in any desired combination. For example, a six degree of freedom platform could be provided.
While some of the attitudinal or positional degrees of freedom discussed above may be added or subtracted in a given application in different combinations, it should be realized that other degrees of freedom that are different in kind from those discussed above may also be added to an n-axis platform. For instance, the attitude sensor 62, as shown in
Thus various combinations of monitoring of degrees-of-freedom of body parts can be used. Not only selected head and/or eye attitudinal degrees-of-freedom but also translatory (positional) degrees-of-freedom of the head can be monitored in one or more axes. These are altogether then emulated on the n-axis platform. Depending on the number of body parts and spatial motions thereof monitored, any correspondingly appropriate multi-axis positioning platform can be used. A platform based on those used for conventional flight-simulators but scaled down for a camera-sized application can be used. For instance, an even more scaled down version of the six degree of freedom principle demonstrated by the Polytec PI “Hexapod” can be used (Polytec PI, Inc., Suite 212, 23 Midstate Drive, Auburn, Mass. 01501 USA, the subsidiary of Physik Instruments (PI) GmbH & Co., and Polytec GmbH, both of Polytec-Platz 5-7, 76337 Waldbronn, Germany).
It will now be more fully realized from the foregoing, as mentioned above, that there will now be new opportunities for far more dynamic types of telepresence applications, including remote video monitoring, particularly on the Internet, and in ways heretofore never even contemplated. In particular, it can be foreseen that there will be extremely high demand for exciting, new telepresence applications.
As also mentioned above, these telepresence applications suffer from an underlying assumption borrowed from the art of “virtual reality” where the user is enabled to navigate within a virtual environment in a highly autonomous manner. The user takes command of the virtual environment and actively controls all of the responses of the reality engine according to monitored activity of the user. This has been shown extended to a wideband network in
According to the present invention, the remote monitoring carried out under the control of a remote active viewer using an HMD/attitude sensor 56, 62 combination, such as in
For instance,
As shown in further detail in
Considering the foregoing, the systems of
A conventional display 128 responsive to a signal on a line 130 from an interface 132 can be used instead of the HMD 56 or the device such as shown in U.S. Pat. No. 5,436,638. An attitude sensor or a conventional input device such as a mouse, joystick or the like 134 can be used, or a sensor such as shown in U.S. Pat. No. 5,436,638, to provide an upstream control signal on a line 136 to the interface 132. The interface 132 interchanges a bidirectional signal on a media line 138 with a wideband internet service provider 140 connected to the internet 76 by a line 142.
The wideband internet service provider 80 could own and operate the remotely located cameras and provide internet access to the various active viewers of
It should be realized that the displays need not be the versatile active/passive displays described here. The displays 150, 160, 162 can be designed to be useable purely as active displays such as the display shown in U.S. Pat. No. 5,436,638 to Bolas et al. Likewise, the displays 152, 154, 156, . . . , 158 can be designed to be useable purely as passive displays such as the various displays shown in co-pending U.S. patent application Ser. No. 08/794,122 filed Feb. 3, 1997, now U.S. Pat. No. 6,181,371, or even the simple conventional monitor 128 of
It should also be realized that the selectable mode (active/passive) display does not have to include a detachable helmet mounted display for use when the active mode is selected. For instance,
On the other hand, the user can instead use the mouse 218 to select one of the more popular sites that is already under active control indicated by “(now active)” such as Niagara Falls. In that case, the telepresence server 146 and reality engine 148 are responsive to the already active user's actions for causing images to be gathered from attitudes dictated by the active user and for providing the gathered images and the sensed yaw, pitch and roll signals to the device 163 for use in a passive way. In other words, the communications network 144 provides the gathered images and sensed yaw, pitch and roll signals from the device 150 used in an active way and provides them on the line 232 to the modem 230 which in turn provides them to the processor 210 for display on the display 164 and for controlling the yaw, pitch and roll motors by control signals on lines 234, 236, 238 for controlling the device 163 and hence the attitude of the display 164. In this way, a camera and associated platform at a popular site can be used by more than one user although only one is active. The same principle applies to accessing any kind of popular reality engine (such as preprogrammed “virtual reality” scenarios) which might otherwise be inaccessible because of high demand.
Although the invention has been shown and described with respect to a best mode embodiment thereof, it should be understood by those skilled in the art that the foregoing and various other changes, omissions and additions in the form and detail thereof may be made therein without departing from the spirit and scope of the invention.
Claims
1. Method, comprising:
- providing a web page showing a plurality of available videos from a corresponding plurality of video cameras of various owners, said cameras for acquiring said videos and made available by said owners over a telecommunications network for selection by a plurality of users using said web page on a corresponding plurality of displays in different geographic locations,
- providing videos selected by said plurality of users via said telecommunications network to said plurality of users of said corresponding plurality of displays according to selection signals received over said network from said plurality of users wherein each selection signal is indicative of a particular video selected by a particular user of said web page and wherein each of said plurality of available videos is selectable for use in viewing at a same time by more than one user of said plurality of users of said web page at said different geographic locations, and
- storing one or more of said plurality of available videos for display on said web page as stored videos for selection by said plurality of users so that video selection for viewing is delayed in time after video acquisition by said cameras to any extent.
2. The method of claim 1, wherein at least one user of said plurality of users is able to use a selected live video as an active user, that is, by providing a user control signal for controlling said live video and, alternatively, to use said live video as a passive user, that is, without providing any user control signal for controlling said live video.
3. The method of claim 1, wherein at least one of said one or more user control signals is provided over said network by an active user for actively controlling a corresponding live video.
4. The method of claim 3, wherein said active user is also able to use a selected live video as a passive user, that is, without providing any user control signal for controlling said live video.
5. The method of claim 3, wherein multiple users selecting said live video actively controlled by said active user only provide selection signals and are not providing any control signals and are therefore passive users of said live video.
6. The method of claim 1, wherein at least one of said one or more control signals is provided over said network by a remote director user for remotely controlling one or more corresponding live videos.
7. The method of claim 6, wherein multiple users selecting a remotely controlled one of said live videos actively controlled by said remote director user only provide selection signals and are not providing any control signals and are therefore only passive users of said remotely controlled live videos.
8. The method of claim 1, wherein at least one of said one or more control signals is provided by a local director user for locally controlling one or more corresponding live videos.
9. The method of claim 8, wherein multiple users selecting a locally controlled one of said live videos actively controlled by said local director user only provide selection signals and are not providing any control signals and are therefore only passive users of said one or more locally controlled live videos.
10. The method of claim 1, wherein said method is carried out by a server.
11. Apparatus, comprising:
- a memory for storing a plurality of videos received from a corresponding plurality of video cameras of various owners at different sites; and
- a server, for providing a web page for displaying a plurality of available videos on said web page indicative of said plurality of videos received from said corresponding plurality of video cameras, said server responsive to selection signals from a plurality of users in different geographic locations via a telecommunications network, for providing selected videos to said plurality of users via said telecommunications network according to said selection signals wherein each of said one or more videos is selectable by multiple users at a same time, wherein any one or more of said plurality of available videos from said plurality of video cameras are stored in said memory for display on said web page as stored videos for selection by said users so that video selection for viewing is delayed in time after video acquisition by said cameras to any extent.
12. The apparatus of claim 11, wherein a user is able to view a display of a particular selected video as a passive user, that is, without providing any active user control signal back to said server, while another user of said plurality of users provides said active user control signal for controlling said particular selected video and for viewing a display of said selected video as an active user.
13. The apparatus of claim 11, wherein at least one of one or more control signals is provided over said network by a remote director user for remotely controlling one or more corresponding videos.
14. The method of claim 11, wherein at least one of one or more control signals is provided by a local director user for locally controlling one or more corresponding videos.
15. The apparatus of claim 11, wherein an active user control signal can come from any one of said plurality of users any one of which is also able to use said video as a passive user, that is, without providing any user control signal for controlling said video while another user of said plurality of users provides said active user control signal for viewing a display of said selected video as an active user.
16. The apparatus of claim 11, wherein said server is responsive to an active user control signal from one user among said plurality of users for controlling said video actively while others of said plurality of users are without active control of said video but rather use the video passively, according to the control of said one user.
17. Apparatus, comprising:
- means for providing a web page showing a plurality of available videos from a corresponding plurality of video cameras of various owners, said cameras for acquiring said videos and made available by said owners over a telecommunications network for selection by a plurality of users using said web page on a corresponding plurality of displays in different geographic locations,
- means for providing videos selected by said plurality of users via said telecommunications network to said plurality of users of said corresponding plurality of displays according to selection signals received over said network from said plurality of users wherein each selection signal is indicative of a particular video selected by a particular user of said web page and wherein each of said plurality of available videos is selectable for use in viewing at a same time by more than one user of said plurality of users of said web page at said different geographic locations, and
- means for storing one or more of said plurality of available videos for display on said web page as stored videos for selection by said plurality of users so that video selection for viewing is delayed in time after video acquisition by said cameras to any extent.
18. The apparatus of claim 17, wherein at least one user of said plurality of users is able to use a selected live video as an active user, that is, by providing a user control signal for controlling said live video and, alternatively, to use said live video as a passive user, that is, without providing any user control signal for controlling said live video.
19. The apparatus of claim 17, wherein at least one of one or more control signals is provided over said network by a remote director user for remotely controlling one or more corresponding videos.
20. The method of claim 17, wherein at least one of one or more control signals is provided by a local director user for locally controlling one or more corresponding videos.
Type: Application
Filed: Mar 12, 2007
Publication Date: Jul 5, 2007
Inventor: Francis Maguire (Southbury, CT)
Application Number: 11/684,704
International Classification: H04N 7/16 (20060101); H04N 7/173 (20060101);