Computer user interface for audio and/or video auto-summarization
A relativity controller is a scroll bar/window combination that provides a way to see data in relation to both the context of its wholeness and the salience of its contents. To accomplish this, the linear density or other appearance of the scroll bar (acting as a ruler or scale) varies with the density of the document salience (as indicated by different kinds of annotations or marks). It also provides a way to zoom between perspectives. This is usable on many different data types: including sound, video, graphics, calendars and word processors.
This invention relates to a computer system, and in particular to computer tools to improve user perspectives and enhance navigation or browsing of information sources stored in or available via the computer.
BACKGROUND OF INVENTIONAs computer accessing of large quantities of information increases, the ability of users to navigate large information spaces and to maintain visualization or personal perspectives thereof decreases [1] (bracketed numbers reference publications identified in Appendix A).
The heed for this type of control has been expressed most recently by Furnas [2], Mills [3], Degen [4], and Chimera [5].
Furnas' solution to the problem of understanding the limited information available in a window of large information structures is to provide in the window the detailed region to be considered in the context of important preceding or succeeding parts of the large structure. For example, to edit lines in the middle of a program, the window would also display, say, declarations at the beginning of the program. No magnification of desired information or shrinkage of undesired information is employed; rather, the desired program information is normally displayed, and many parts of the program are omitted from the display.
Mills et al addressed the issue of giving users access to video data by magnifying time through successive hierarchial extraction of increasingly detailed segments. Each expanded segment view was displayed in a separate window of the display. And each segment view, as well as the total video view, including the time lines associated therewith, were linearly arranged from a temporal standpoint.
Degen et al moved marks on audio tape to a digitized counterpart document scroll bar, and let the user change the visual scaling of time within a single window, as well as the speed of playback. But, again, the visual representations, whether of the original size or of the zoomed expanded size, had a linear temporal structure.
Chimera, on the other hand, maintained a full display within the window but was unable to provide a zooming feature or expanded segment view of a text listing. Instead, Chimera used scroll bars that, independent of the original data's representation, indicate relative values of list attributes by respectively scaling proportions of list item indicators, according to those attributes, in the scroll bars.
Furnas shows in a single window multiple fisheye views of document segments. But Furnas doesn't disclose how a user can select which segments to display, or the means to magnify certain segments, or the means to control the degree of magnification, nor does Furnas provided a scroll bar or its equivalent as a convenient interface for the user to manipulate the display.
SUMMARY OF INVENTIONAn object of the invention is a computer system providing improved means to allow users to extract important segments of computer-displayed information in the form of video, sound, graphics or text while maintaining a general view of the information.
Another object of the invention is a computerized system and method to enable users better to navigate or visualize large information spaces.
In accordance with one aspect of the present invention, means are provided to enable a user to visibly mark points or segments of displayed information, which will enable the user to quickly navigate to the marked displays.
In another aspect of the invention, a scroll bar is displayed alongside the information display, and the visible mark or marks appears on the scroll bar at locations corresponding to the desired information.
In accordance with a further aspect of the present invention, a computerized system provides the user with means to shrink less important or less significant portions of the information displayed, with the result of magnifying the portions that the user deems significant. In accordance with this aspect, the invention can be viewed as a user-friendly relativity controller tool that enables users to specify what is important to them, and modify the portion of their perceptual space that that information takes up, in a fisheye variant.
In accordance with another aspect of the invention, the resultant information can still occupy the same window where originally displayed, but with certain segments shrunk and other segments in comparison standing out or becoming more prominent.
In accordance with still other aspects of the invention, the relativity controller of the invention is implemented by simply pointing to the screen and actuating a control device. In a preferred embodiment, a mouse button, is pressed to mark the beginning and end of segments of the information to be marked. A further feature is that multiple segments can be marked in this manner. Thus, the relativity controller of the invention not only allows users to mark the scope of one or more salient segments, but also will cause the display to simultaneously shrink the non-marked portions and in effect zoom into the multiple-marked segments in a single step. The result is a non-linear display of the available information. As a further feature, simultaneously with selective zooming of the information, the display of the scroll bar is correspondingly modified to show in the context of the total information the marked and non-marked portions of the displayed information.
The major benefits is to allow users to quickly navigate through a large information space and to control the salience of the displayed information in the context of the full display while conserving display area, sometime called desktop real estate. Moreover, maintaining a single window for the data and giving users the ability to visually navigate across the whole data via the scroll bar together with the ability to select the salient segments as well as the level of zoom, all in a single step, greatly enhances the ability of the user to cope intelligently and rapidly with large information structures containing large numbers of objects.
The above and further objects, details and advantages of the present invention will become apparent from the following detailed description of preferred embodiments thereof, when read in conjunction with the accompanying drawings.
SUMMARY OF DRAWINGSThe following terms used herein will have the following meanings.
“Object” means any representation of information or of a data structure that can be displayed on the monitor screen, and includes one or more text characters, one or more sound representations such as a digital sample, a video representation such as a video frame, and in general any graphic's element.
“Control device” means devices manipulated by users to move cursors around a screen, and include a mouse and keyboard.
“Pointing” to an object on screen means actuating the control device to move the cursor so that it is over or adjacent the object. When the cursor is a pointer such an arrow, it means moving the arrow tip close to the object.
“Clicking” on an object means to press and quickly release a switch on the control device, such as a button on a mouse, when the cursor is pointing to the object.
“Dragging” means to click on the object, and while holding the switch activated, to manipulate the control device to move the object to a new screen location, and then to release the switch to fix the new screen location of the object.
“Doubles-clicking” an object on screen is by pointing to the object and clicking twice rapidly, often used for special control purposes.
“Shrinking” the display of objects means reducing the time or space normally allocated to display the objects, and includes shrinking them to the point where they essentially disappear from the display.
A “scroll bar” is a common control device displayed alongside a window, having, typically, at opposite ends small arrowed scroll boxes or buttons that when clicked on by the user causes the window contents to scroll.
A “button” or “box” on a scroll bar is a representation of a control device for use with a mouse.
A “thumb” is a button or box on the scroll bar, between its ends, which moves and whose location on the scroll bar corresponds to the location in the whole information of the current view.
The first example concerns a sound representation. As illustrated in
In a usual GUI display, a horizontal title bar 37 is located on top and a vertical menu or tool bar 38 is displayed at the left side. Clicking on any of the icons displayed in the tool bar will invoke appropriate software routines to carry out the function indicated by the icon. In this particular example, the user desires to annotate the sound representation, and the icons can represent an EDIT function, or a DRAW function including certain graphic symbols to be pasted into the sound representation.
In accordance with an aspect of the present invention, the computer has been trained or customized to recognize meaningful objects and mark them. In this particular case, a meaningful object can be any sound representation above a certain amplitude, i.e., loud sounds, but the computer can choose instead certain frequencies or ranges or certain sound sequences. Marking means with respect to the data structure representing the object to add a tag bit or other data representing a marked time or space position or point. If it is desired to mark a segment, meaning a temporal sequence of objects, then one tag data can represent the beginning of the marked segment, and another data bit can represent the end of the marked segment.
In accordance with another aspect of the present invention, the mark is displayed on the display. In
In accordance with another aspect of the invention, the scroll bar temporal representation is modified to display the marked points or segments. In the embodiment illustrated in
It will also be observed that the scope or range of the marked objects is visible on the scroll bar 32 by the width of low density segments 43.
In accordance with a further feature of the invention, means can be provided to execute a relativity controller function. This can be implemented automatically whenever a marking of salient points is made, or it can be implemented by, for example, pointing to the scroll, bar, clicking, and then dragging the mouse perpendicular to the scroll bar, or it can be implemented, as explained later, by clicking on a special button added to the scroll bar and then dragging the mouse. In the flow charts described later, an option key is also used when clicking on the scroll bar. When the relativity controller function is activated, the computer modifies the linear temporal representation of the sound into a non-linear representation, with the non-marked segments shrunken in time and the marked segments expanded in time into the resultant empty regions and thus magnified. This is also illustrated in
Note, further, in
Marking of the video can occur in the same manner as the audio, such as pressing a mouse button when the cursor is on the video to mark the beginning of a segment and releasing the button to mark the end of the segment. The resultant marks can be displayed on the video, or in the scroll bar, or on both.
In this aspect of the invention, not only is the user allowed to select and display the scope of salient segments, but as a further feature allows the user to vary the degree of magnification of the salient segments. It will also be understood that, besides size, other scroll bar changes can be used to represent the salient segments and/or different levels of magnification. For example, different colors can be used to represent on the scroll bar the salient and non-salient segments selected at different times or by different users, and if desired the intensity of the color used to illustrate level of magnifications.
The relativity controller application program will not interfere with the normal functions available in programs such as Apple QuickTime, while providing the additional functions described above. A listing of available functions for a preferred embodiment, which is not meant to be limiting, appears below, to be used with, for example, an EDIT menu as depicted in
The edit menu allows the user to perform the normal functions on displayed information, as well as the ability to remove any marks made by the user on the screen display or the scroll bar. what appears below is a description of functions available to the user to carry out the invention. One way of implementing these functions in software are shown in the program flow charts illustrated in
-
- Adjust position in movie time
- click or drag mouse in scroll bar
- Play & Pause movies from/to anywhere in movie (beginning<->end)
- press Play/Pause button
- Mark segments of movies (while playing or paused)
- press mouse (and hold down for duration of mark) within movie window
- Review individual marked segments
- click on a mark, and press Play Segment button
- Adjust relationship between marked and unmarked segments
- click on Scale Perspective button and drag mouse vertically
- (up into movie-> relative scaling; down out from movie-> absolute scaling)
- option-click on slider thumb and drag mouse vertically
- (one step navigation to/from specific point in time & resolution)
- click on Scale Perspective button and drag mouse vertically
- Remove 1 or all marks in movie
- click on a mark, and select Remove Marker (cmd-R) from Edit Menu
- click on a mark, gesture (press, drag, lift) left with mouse within mouse within movie window
- select Remove All Markers from Edit Menu
- Change current marking color
- use Apple ‘Color’ control panel to choose Selection Color
- (may facilitate collaboration of groups of people (diff. color per person))
- use Apple ‘Color’ control panel to choose Selection Color
- Copy frames or marked segments to the Macintosh clipboard & other applications
- select Copy Frame or Copy Segment (cmd-C) from the Edit Menu
- Save document marks and perspective
- select Save (cmd-S) from File Menu
- High Speed annotation:
- adjust Scale, press Play, then mark segments
- can also be used by scaling to fast-forward to stop when hits a marked segment.
- Adjust position in movie time
As is conventional in the Macintosh, the left button 83 (
Various features of the invention as well as modifications are also indicated below:
-
- Marks can be drawn inside scroll bar to keep desktop real estate usage down
- Marks can be colored to indicate different users or states of notation
- Scale Perspective cursor changes (while adjusting, scale) to reflect size of segments in scroller (also dynamically changing)
- Relationship between marked and unmarked segments is reflected in:
- speed of playback
- (unmarked segments, speed-up with perspective relativity)
- size of marks in scroll bar
- (unmarked segments shrink in proportion to play speed)
- (marked segments enlarge to fill the remaining scroll bar space)
- color of scroll bar area: indicating density of scale (looks like depth of field)
- (unmarked segments get darker in proportion to size in scroll bar & speed)
- (marked segments get lighter in proportion to size in scroll bar)
- speed of playback
- At more relative scale, user has higher resolution access to time in that area
- (moving scroll thumb passes through less frames per pixel)
- Overlapping marked areas join to form single marks (with 1 scope & 1 color if desired)
- if new mark falls between original startTime and endTime, then
- newColor=¼(new)+¾(orig.)
- if new mark overlaps original startTime or endTime, then
- newColor=(orig.+new)/2
- if new mark Overlaps both original startTime and endTime, then
- newColor=¾(new)+¼(orig.)
- if new mark falls between original startTime and endTime, then
- ‘Save’ menu item is enabled when user modifies marks or scale
- (perspective is part of the document)
- Marks & scale are saved inside movie files as QuickTime ‘user data’
- Mark data consists of scope (startTime, endTime) and color when chosen (RGB)
- Gives audio feedback when removing marks from segments
- Marked movies have unique ‘stamped movie’ icon on Desktop
Also listed below is a summary of a few data types with examples of how the invention can be applied:
Implementation of the various forms of the invention will be evident to those skilled in the art. Reference is made to “Inside Macintosh”, (published by Addsion-Wesley) which provides the code for developers for various kinds of interface constructs, such as scroll bars, control bars, slide controls, and boxes used therein, as well as how to display them in different colors or appearances, and how to invoke program routines when a user clicks on a box or icon, and how to change the appearance of an icon when a routine is executed. See, also, U.S. Pat. No. 4,931,783, which describes operation of a system with the Apple Graphical User Interface, whose contents are herein incorporated by reference.
To further assist those skilled in the art,
The person skilled in the art will have no trouble in understanding and implementing the flow charts illustrated. Virtually all of the statements printed in the flow chart boxes are understandable, and no need exists to repeat the text herein. However, certain statements require some explanation. The statements in the blocks indicated by double lines, such as block 85 in
To summarize some important aspects of the invention:
-
- Linear density of scroll bar (i.e. ruler), can be varied with document salience density;
- Scrolling rate can be varied with document content density;
- Amount of document in window can be varied with document salience density;
- The zoom control function can be implemented by clicking the scale perspective button and dragging perpendicular to scroll bar to zoom between perspectives:
- drag out from document->absolute scaling and
- drag towards document->relative scaling.
As a further alternative, the user can press an option key and click on the scroll bar, which will jump the thumb to the pointer position and simultaneously allow the user to scroll by moving the mouse horizontally and to change scale or magnification by moving the mouse perpendicularly (vertically) to the scroller. These changes will be visible on the screen display as well as on the scroll bar.
Since the program of the invention runs as an application, clicking on the document display can readily be used to add to the document data structure in memory the time or spatial position of the salient marked display portion when/where the pointer rested.
Marking data structures will be evident to those skilled in the art. For text documents, adding a mark is generally similar to adding a formatting or printing code to the stored text. Marking video is similar to text marking, except that remembering character position is replaced by remembering time position and storing it in the user data portion of the movie.
As further marking alternatives, for video, the mouse button for marking can be held depressed while the video plays and released to define a marking point or segment. For text, the salient text can be highlighted and a menu dropped to select a marking function.
Although there have been described what are at present considered to be the preferred embodiments of the invention, it will be understood that the invention may be embodied in other specific forms without departing from the essential characteristics thereof. The present embodiments are therefore to be considered in all respects as illustrative, and not restrictive. This scope of the invention is indicated by the appended claims rather than by the foregoing description.
Claims
1. In a computerized system having a computer, storage means for information represented by a sequence of displayable objects, a display for displaying such objects, and a user control device for said computer, said computer having means for accessing the storage means and for causing the display to display certain of the objects under control of the control device, the improvement comprising:
- (a) first means in the computer responsive to the user control device for selectively visibly marking on the display a first portion of the objects displayed for subsequent retrieval or emphasis.
2. The system of claim 1, further comprising:
- (b) second means for displaying alongside the displayed objects a scroll bar as part of the control device,
- (c) third means connecting the second means to the first means such that marking of the object display causes a corresponding change in the appearance of those scroll bar portions located opposite the marked objects.
3. In a computerized system having a computer, storage means for information represented by a sequence of displayable objects, a display for displaying such objects, and a user control device for said computer, said computer having means for accessing the storage means and for causing the display to display certain of the objects under control of the control device, the improvement comprising:
- (a) first means in the computer for displaying alongside the displayed objects a scroll bar as part of the control device,
- (b) second means in the computer responsive to the user control device for selectively marking on the scroll bar an area corresponding to a first segment of the objects displayed for future retrieval or emphasis.
4. The system of claim 3, further comprising:
- (c) third means connected to the second means for transforming the object display into shrunken and non-shrunken segments corresponding to the marked areas and for causing a corresponding change in the appearance of those scroll bar portions located opposite the marked objects.
5. In a computerized system having a computer, storage means for information represented by a sequence of displayable objects, a display for displaying such objects, and a user control device for said computer, said computer having means for accessing the storage means and for causing the display to display certain of the objects under control of the control device, the improvement comprising:
- (a) first means in the computer responsive to the user control device for selectively shrinking the display of a first segment of the objects displayed to emphasize the display of a second segment of the displayed objects.
6. The system of claim 5, further comprising:
- (b) second means for displaying alongside the displayed objects a scroll bar as part of the control device,
- (c) third means connecting the second means to the first means such that transformation of the object display into shrunken and non-shrunken segments causes a corresponding change in the appearance of those scroll bar portions located in proportion to the shrunken or non-shrunken object segments in the document.
7. The system of claim 6, wherein the third means comprises means for changing the density of the appearance of the scroll bar, the density changes corresponding to the shrunken and non-shrunken segments.
8. The system of claim 6, wherein the third means comprises means for changing the color or brightness at those portions of the scroll bar corresponding to the shrunken or non-shrunken segments.
9. The system of claim 6, further comprising a perspective scalar button on the scroll bar, said computer having fourth means responsive to activation of the perspective scalar button for modifying the level of shrinkage and thus the level of magnification of the non-shrunken segments.
10. The system of claim 6, wherein said fourth means is responsive to movement of the user control device in a direction perpendicular to the scroll bar.
11. The system of claim 5, wherein said control device includes a switch, a cursor is visible on the display, and said first means is responsive to the cursor being positioned on the displayed objects and to actuation of the switch to mark the second segment of the displayed objects.
12. The system of claim 6, further comprising a controller button on the scroll bar, said computer having fourth means for producing a cursor on the display whose position is controlled by the control device, said computer further having fifth means such that when the cursor is adjacent the controller button and the user activates a switch on the control device, sixth means are activated for controlling the level of emphasis of the second segment.
13. The system of claim 12, wherein the sixth means includes means for displaying a perspective scalar cursor whose size or appearance indicates the said level of emphasis.
14. The system of claim 5, further comprising the computer having fourth means in response to actions of the control device for marking a segment of the displayed objects to constitute the second segment.
15. The system of claim 14, further comprising the computer having fifth means to modify the appearance of the scroll bar to correspond in position to the location of the first and second segments.
16. The system of claim 14, further comprising the computer having fifth means under user control for varying the degree of shrinkage of the first segment.
17. A method of changing the display of related objects in a computerized system having a relativity controller to magnify selected ones of the objects, comprising the steps:
- (a) forming a normal display of a succession of the objects in which each object displayed occupies a part of the display in proportion to its size,
- (b) forming on the display alongside the succession of objects a scroll bar having an elongated area representing the beginning to the end of the succession of objects, a scroll thumb on the elongated area and whose location therein corresponds to a selected object of the succession of objects, and means associated with the scroll bar for activating a means for changing perspective scaling of the objects,
- (c) providing for a user an interactive cursor control device having user-activated switches,
- (d) providing on the display a cursor whose location is controlled by the control device,
- (e) actuating the control device to mark a first segment of successive objects and to display on the scroll bar elongated area a first indication of the marked segment,
- (f) activating the control device to activate the perspective scaling means to select a level of magnification of the marked segment thereby to cause objects not in the marked segment to occupy a smaller region of the display than they originally occupied to form additional regions adjacent the marked segment of objects and to cause the objects in the marked segment to expand to fill the additional regions.
18. The method of claim 17, wherein step (e) is repeated to form a second indication at a different location along the scroll bar to define a second marked segment, carrying out of step (f) causing objects not in the first or second marked segments to occupy smaller regions forming the additional regions into which the objects in the first and second marked segments can expand.
Type: Application
Filed: Oct 30, 2007
Publication Date: Jul 3, 2008
Inventor: Eric Justin Gould (Austin, TX)
Application Number: 11/978,964
International Classification: G09G 5/00 (20060101); G06F 3/038 (20060101); G09G 5/08 (20060101);