METHODS AND SYSTEMS FOR ALTERING VIDEO CLIP OBJECTS
The present disclosure relates generally to content delivery techniques in audio-visual streaming systems. The techniques include altering video or audio portions of media content based on user input or interaction. The techniques further include altering text or messaging distributed to multiple users based on user input.
This application claims priority to U.S. Provisional Patent Application No. 62/348,706, filed on Jun. 10, 2016, the disclosures of which are hereby incorporated by reference in their entirety for all purposes.
BRIEF DESCRIPTION OF THE FIGURESIllustrative embodiments are described in detail below with reference to the following figures:
In the following description, for the purposes of explanation, specific details are set forth in order to provide a thorough understanding of embodiments of the disclosure. However, it will be apparent that various embodiments can be practiced without these specific details. The figures and description are not intended to be restrictive.
The present disclosure relates generally to content delivery techniques in audio-visual streaming systems.
I. Streaming OverviewThe entities depicted in
One or more of the entities described herein can utilize one or more communication networks (e.g., communication network 115) to facilitate communications. The communication network 115 can include one or more networking devices or equipment including but not limited to network switches, routers, bridges, and load balancers. Examples of the communication network 115 can include the Internet, a wide area network (WAN), a local area network (LAN), an Ethernet network, a public or private network, a wired network, a wireless network, a virtual network, or any combinations thereof. Different communication protocols can be used to facilitate communications through the one or more communication networks, including wired and wireless protocols such as the IEEE (Institute of Electrical and Electronics Engineers) 802 suite of protocols, Internet Protocol (IP), TCP/IP (Transmission Control Protocol/Internet Protocol), UDP, RTSP, Asynchronous Transport Mode (ATM), frame relay network protocols, Multi-Protocol Label Switching (MPLS), OpenFlow, IPX (Internetwork Packet Exchange), SAN (Storage Area Network) protocols, AppleTalk, Bluetooth, Long-Term Evolution (LTE), and other protocols.
In an example, a client device of the client devices 102, such as a laptop or desktop computer 102B, can utilize a web browser application to access the audio-visual streaming system 110. In some embodiments, the web browser application can include FireFox, Safari, Chrome, or Internet Explorer. In other embodiments, the web browser application can use an application designed for (or operable using) devices having an operating system such as iOS or Android. In such embodiments, Hypertext Transfer Protocol Live Steaming (HLS) or Progressive/HLS can be used as a video delivery protocol. In some embodiments, Hypertext Transfer Protocol (HTTP) can be used for communications. In some embodiments, authentication tokens can be used for access. In some embodiments, expiring tokens can be used for content-specific URLs. In some embodiments, single-use content URLs, concurrent stream detection, geographic blocking, device invalidation support, secure application storage, and/or audio-visual output protection can be used.
II. Aggregated Video CommentsIn some embodiments, one or more video clips in a video clip object can be sent to client devices 102 as individual video files. In such embodiments, metadata associated with the individual video files can link the individual video files together for display. In some embodiments, client devices 102 can receive data separate from the individual video files. The data can describe the linkages between the individual video files. In some embodiments, when video objects are displayed at client devices 102, the client devices 102 can display one video clip after another. In some embodiments, one or more video clips in a video clip object can be graphically linked together into a string of videos, with the videos displayed by scrolling up, down, or sideways by user input and/or automatically.
Instead of (or in addition to) sending video clip objects as logically linked collections of video clips, video clip objects can be rendered into smaller collections of files for transmission and display. For example, in a system where question clips and response clips are displayed, the questions clips and the response clips can be rendered into a single video file (and/or multiple video files) for transmission and display. In such an example, the questions clips and response clips can be displayed at a client device by moving forward and/or backward in the video file.
The timeline 200 begins with an identification 220 and a transmission 225 of a video clip object by a server 202. The video clip object can include a video clip 290-1. In some embodiments, the server 202 can be similar to the audio-visual streaming system 110 from
In some embodiments, the video clip 290-1 can be generated on a client device (e.g., a mobile device 102A, a laptop or desktop computer 102B, and/or other devices capable of generating multimedia). In other embodiments, the video clip 290-1 can be produced by another entity, e.g., a video used to promote a multimedia offering (e.g., a movie and/or a television program). In some embodiments, the video clip 290-1 can be termed a “seed” clip—a question (e.g., “Show off your setup”) that is targeted to elicit responses. In some embodiments, a reponse can include one or more responsive video clips.
In some embodiments, the server 202 can identify the video clip 290-1 after the video is uploaded by a user, and/or retrieved from storage. After the video clip 290-1 is stored at the server 202, and handled by the server 202 as a video clip object, the video clip object can be transmitted using a network (e.g., the Internet), to a mobile device 205-1 and a mobile device 205-2 using a similar process as described above.
In some embodiments, the video clip object can be displayed (230-1) using a mobile device 205-1. For example, a user can select the video clip object for playback and watch the video. In this example, upon seeing the video with the seed question, the user can answer the question and/or comment. Using embodiments described herein, the user can record a response video clip 250-1. In this example, the response video clip 250-1 can be a short video showing content (e.g., a person speaking) that is in response to the video clip 290-1. In some embodiments, response video clip 250-1 can be transmitted to the server 202 (not shown), and the server 202 can receive (240-1) the response video clip 250-1. In some embodiments, the response video clip 250-1 can include metadata that associates the response video clip 250-1 with the video clip 290-1. The metadata can be created by a client device upon creation of the response video clip 250-1. While the response video clip 250-1 can be generated by a user, the video clip object can be viewed (230-2) by another user using a second mobile device (similar to a mobile device 205-2).
While
In some embodiments, while a User B is viewing and responding to a video clip, one or more other users can also view the video clip 290 on one or more other devices (e.g., a mobile device 205-2) (230-2). In such embodiments, a response video clip 250 can be generated (235-2), and the response video clip 250 can be transmitted to (not shown), and received (240-2) by the server 202. In some embodiments, after response video clip 250-1A is received, video clip object 295 can be altered by the server 202 to include the response video clip 250 (not shown) . As described above, some embodiments update a database that describes the content of video clip objects. Other embodiments render the two video clips together into a single file. And other embodiments perform a similar combination of actions. Some embodiments can also attach metadata to the response video clip 250 that links the response video clip 250 to the video clip 290.
As shown in
Because of a potentially large number of response video clips (similar to the response video clip 250) that can be received, some embodiments can improve an operation of the server 202 by automatically selecting a subset of all the response video clips (such as the response video clip 250) received. The selection can be performed by selecting the segments in order, selecting segments from certain system users, selecting random segments, and/or other ways of selecting video segments.
In addition, to improve system efficiency and user satisfaction, as discussed above, the response video clips (similar to the response video clip 250) can be automatically sorted to rearrange the order in which the response video clips (such as the response video clip 250) are ordered for presentation, e.g., the logical order discussed above, or the order video clips appear in a video file. It should be appreciated that different orders for video clips can be set at any time, for any reason, e.g., user preferences, location, upvotes, and/or other relevant factors. Different orders for video clips can be automatically set for each user, based on the respective characteristics of a user.
In some embodiments, user system 860-1 can transmit a message video clip object 840-1 to user system 860-2. In some embodiments, the message video clip object 840-1 can include a video clip 830-1. The video clip 830-1 can include content (e.g., “How are you?”) that contains a message directed to one or more specific user systems (e.g., user system 860-2). User system 860-2 can receive the message video clip object 840-1. In some embodiments, user system 860-2 can identify a reply message video clip 830-2, in response to the message video clip object 840-1. In some embodiments, the reply message video clip 830-2 can include content that contains a reply message directed to the user system that sent the message video clip object 840-1 (e.g., “I am fine, what are you up to?”), such reply being generated by user system 860-2 (e.g., using the camera on a mobile device).
In some embodiments, the audio-visual streaming system 110 can alter the message video clip object 840-1 to include the reply message video clip 830-2, resulting in a message video clip object 840-2. In some embodiments, the alteration can occur at the user system 860-2. In some embodiments, the video clip 830-1 and the reply message video clip 830-2 are not altered, rather they are logically linked in the message video clip object 840 such that they can be presented by the audio-visual streaming system 110 in the sequence shown. Continuing for communications 850-2, and 855-2, in some embodiments, the message video clip object 840 increases in size during each pass.
In communication 850-3, user system 860-1 can be configured to reorder the video clips 830 of message video clip object 840-5. It should be appreciated that the message video clip object 840, composed of video clips 830, can be automatically reordered by embodiments based on different criteria. For example, in some embodiments, video clips 830 can be prioritized. User system 860-1 can be configured to place a video clip 830-5 at the beginning of a message video clip object 840-5 based on an assigned priority.
In some embodiments, at the end of a sequence of communication (e.g., communications 850, 855) message video clip object 840-5 can be stored 865 (e.g., in storage 860) for later retrieval and viewing.
In some embodiments, the communications 850, 855 can be point-to-point communications. In other embodiments, the communications 850, 855 can be point to multipoint (e.g., a third user system also receives the contents of one or more communications 850 and/or 855). In other embodiments, the communications can be multipoint to multipoint. In some embodiments, communications 850, 855 are not to be considered broadcast communications (e.g., the message video clip 840 is not available to the Internet and/or user systems to which communications 850, 855 were not directed).
IV. Inset Video CommentingReturning to
In some embodiments, as shown with the combination of second video clip 985-4 and first video clip 925-4, the first video clip 925-4 can be altered by rendering the second video clip 985-4 to be displayed at the position specified by the position metadata described above. In some embodiments, the position metadata (not shown) can be stored as a part of one of the video clips, or can be stored for use in a database. As described above, this approach can be used for any of the embodiments having multiple video clip described herein.
In some embodiments, as shown with first video clip 925-5 and second video clip 985-5, both video clips can be stored separately and, upon request to be played by a requesting user system, both video clips can be sent out separately, and rendered together at the user system. In such embodiments, the alteration of the first video clip 945-1 can occur at the user system, under the direction of server 910. This approach can also be used for any of the embodiments having multiple video clip described herein.
In some embodiments, to create the animated pictographs 1230, a user (before, during and/of after a video 1225 is displayed) can select an “Add a Reaction” user interface button, and a recording can begin. In some embodiments, when the “Add a Reaction” user interface button is selected, a reaction can be displayed in window 1240. Once the recording is completed and the video clip is transformed, the animated pictographs 1230 can be stored for further use. In some embodiments, the animated pictographs 1230 can be used for a variety of application uses, e.g., the video clip comments described above with the descriptions of
In some examples, a user A 1430, before, during, and/or after recording the video clip 1425, can select and configure an overlay that requests a user viewing the video clip 1425 to select an answer to a question. Configuring the overlay can include specifying the text of the question, the number of choices, and the text associated with each choice 1422. Similar to the process described with
As would be appreciated by one having skill in the relevant art(s), given the description herein, the selections of choices (such as choice 1422) by viewing users can be collected at an audio-visual streaming system (e.g., similar to audio-visual streaming system 110 from
Conventionally, messages received by a user from one or more users in a group are organized in a temporal fashion. In particular, messages are organized from a least recent to a most recent with no regard to content of each message. Such organizations can cause difficulties with keeping track of different topics in the group. For example, a first user in the group can post a first topic that includes a question. A second user can then post a second topic that includes another question. Both topics can be arranged one after another. Then, an answer by a third user can be posted after the two topics. However, the answer can be ambiguous as to which question the answer is associated with.
Embodiments herein allow users to designate a new topic within a messaging application when sending a message. For example, a user can select that the user wants to create a new topic by selecting that the message is a topic message. Other messages in the messaging application can be replies to one or more topic messages. In the description herein, a topic message is also referred to as a text card. By using text cards, messages can be organized based on groups of messages. For example, a text card can allow a user to send one or more users the text card such that one or more messages associated with the text card are kept together with the text card.
A. Creating a Text Card
A user can create a text card by indicating to an application that the user wants to create the text card. For example, the application can include a button that allows the user to create a text card. The indication can be sent to a system hosting the application. In some examples, an object can be created on the system to represent the text card. The object can link the text card with the user and one or more other users that the user wishes to share the text card with. By linking the text card with the one or more other users, the text card, and messages associated with the text card, can be viewed by the one or more other users. The text card can appear to the one or more other users as a message in a thread, or conversation. In some embodiments, a first user of the one or more other users can be linked to the object by the object including an identifier of the first user in the object.
In some examples, rather than linking the object directly to the one or more other users, the user can choose to send the text card to one or more groups. The one or more groups can be selected when the user creates the text card. Each group can include one or more pre-identified users. In some examples, the group can be created by the user, by a second user, or be automatically created by the application based on similar interests or characteristics of users. In such examples, the object can be linked to a group by including an identifier of the group in the object.
When the text card is posted, the users in the group can view the text card. In some examples, a thread (or conversation, or an interaction between the users) can include two or more text cards. In such instances, the thread can list the two or more text cards in a temporal fashion. For example, a first text card that was posted first can be in a position indicating that the first text card was posted least recent. In such an example, a second text card can be posted such that the second text card is in a position indicating that the second text card was posted after the first text card.
In some examples, the first text card and the second card can be associated with a single conversation and different topics. For example, the screen 1520 can include a first text card 1530. The first text card 1530 can be posted by a first user (e.g., Erin). The first text card 1530 can be associated with a first submission box 1545. The first submission box 1545 can allow a user to respond to content of the first text card 1530. The content of the first text card 1530 can includes a message (e.g., “Who wants to go on a hike?”).
The screen 1520 can further include a second text card 1540. The second text card 1540 can be posted by a second user (e.g., Jeff). The second user can be the same as or different from the first user. The second text card 1540 can include content. For example, the content of the second text card 1540 can include a message of “Who wants to go to the movies?”.
B. Posting a Reply
After a user views a text card, the user can reply to the text card. The reply to the text card can be associated with the text card such that the text card and the reply are grouped together on the screen 1520 for others to view. In some examples, a reply, or a link to the reply, is added to the object associated with the text card.
Looking at
The reply 1550 can be visually indicated as a most recent post to the one or more text cards (e.g., the conversation between people). For instance, the visual indication can include a position on a screen (e.g., the most recent post can be located at a bottom of a screen) or a physical transformation (e.g., a color of the reply can change color). The visual indication can also include both position and a physical transformation. A most recent text card, and/or post, to the screen 1520 can also include a visual indication.
By grouping responses to a text card, a user can easily follow an idea or thread in a visual manner. Rather than having to decide whether the reply 1550 of Steve is associated with the first text card 1530 or the second text card 1540, a user can easily see that the reply 1550 of Steve is associated with the second text card 1540.
When a new text card is posted to screen 1520, the new text card can include a visual indication that the new text card is a most recent text card. For example, the new text card can be put on a bottom of a list of text cards. When a new reply is posted to a screen 1520, the text card that is either directly or indirectly associated with the reply can be moved to a position indicating the text card is the a most recent text card. For example, the text card can be inserted at a bottom of a list of text cards. For example, when the second reply 1560 was posted in
Because in the example above there are only two text cards, the switch causes the second text card 1540 to go to the top of the list of the text cards; however, if there are three or more text cards, moving a text card to the bottom would not cause the lowest text card to move to the top of the list, but rather to one spot higher in the list than it previously was. For example, if there are three text cards (A, B, and C), and a reply is posted to A, the new order of the text cards would be B, C, and A. In other words, when an update to the screen 1520, or a thread of text cards, is posted, the update would show up at the bottom of the thread with the related messages.
Process 1600 is illustrated as a logical flow diagram, the operation of which represent a sequence of operations that can be implemented in hardware, computer instructions, or a combination thereof. In the context of computer instructions, the operations represent computer-executable instructions stored on one or more computer-readable storage media that, when executed by one or more processors, perform the recited operations. Generally, computer-executable instructions include routines, programs, objects, components, data structures, and the like that perform particular functions or implement particular data types. The order in which the operations are described is not intended to be construed as a limitation, and any number of the described operations can be combined in any order and/or in parallel to implement the processes.
Additionally, the process 1600 can be performed under the control of one or more computer systems configured with executable instructions and may be implemented as code (e.g., executable instructions, one or more computer programs, or one or more applications) executing collectively on one or more processors, by hardware, or combinations thereof. As noted above, the code can be stored on a machine-readable storage medium, for example, in the form of a computer program comprising a plurality of instructions executable by one or more processors. The machine-readable storage medium can be non-transitory.
At step 1605, the process 1600 includes receiving a first topic message to be posted on a thread in an application. The first topic message can be received at a server from a first device. The thread can be associated with an interaction between a plurality of users. The application can be executing on a mobile device for each of the plurality of users.
At step 1610, the process 1600 further includes posting the first topic message to the thread for the plurality of users to view. The topic message can be posted by the server. At step 1615, the process 1600 further includes indicating visually that the first topic message is a most recent post when the first topic message is posted to the thread. A visual indication can include highlighting the first topic message in a different color or moving the first topic message to a first position in a list.
At step 1620, the process 1600 further includes receiving a second topic message to be posted on the thread. The second topic message ca be receved by the server.
At step 1625, the process 1600 further includes posting the second topic message to the thread for the plurality of users to view.
At step 1630, the process 1600 further includes indicating visually that the second topic message is the most recent post when the second topic message is posted to the thread. For example, the first topic message can be unhighlighted when the second topic message is highlighted. For another example, the second topic message can be posted below the first topic message.
At step 1635, the process 1600 further includes receiving a reply to the first topic message. The reply can be received by the server. At step 1640, the process 1600 further includes associating the reply with the first topic message. Associating the reply with the first topic message can mean grouping the two messages together in the thread such that the two messages are physically near each other.
At step 1645, the process 1600 further includes posting the reply to the threat for the plurality users to view. The reply can be posted by the server At step 1650, the process 1600 further includes indicating visually that the first topic message is a most recent topic message when the reply is posted to the thread.
At step 1655, the process 1600 further includes indicating visually that the reply is the most recent post when the reply is posted to the thread. For example, the second topic message can be unhighlighted when the reply is highlighted. The visual indications mentioned above can be different for each step.
VIII. Management of ContentConventionally, content posted by a user to groups of users require a substantial amount of manual interaction by the user. For example, a user must determine a title for the content. In addition, once content is posted, the user is typically unaware how other users use the content. For example, if content is shared to other people, the original poster might not ever know. Even when the user wants to remove the content because of a request by another user, the user must manually find the content and navigate a deletion procedure.
Embodiments herein allow users to let an application determine a title for content. For example, the application can extract words from content to identify a quick title that can be used for identification purposes. Embodiments herein also allow users to know who is sharing content that they posted as well as provide an easier way for requesting content to be deleted.
A. Delete Request
The delete request button 1740 can cause a system associated with the application to send a request to the mobile device 1710.
If the second user indicates to delete the content 1730 (e.g., by pushing the yes button 1760), the application can automatically remove the content 1730 from the application. In some instances, the yes button 1760 can be bound to the content 1730, such that the yes button 1760 can delete the content 1730. Once the content 1730 is deleted, the first user, or any other user on the application, might no longer be able see the content 1730 on the application. In other instances, by removing the content 1730 from the application, the content 1730 might no longer be loaded for the application. In such instances, a mobile device that already is viewing the content 1730 can continue to view the content 1730.
In some instances, the second user can access a secondary menu on the content 1730. The secondary menu can provide a feature called “request delete.” Upon clicking “request delete,” the second user can include a message (e.g., “Hey, I do not like the shirt that I am wearing in the video.”). The second user can then send the request with the message to the first user. After the request is sent, the second user can see that they have sent a request. For example, a text for the feature called “request delete” can change to “delete request sent.”
The first user can receive a notification that the second user requests the first user delete the content 1730. The content 1730 can be identified by a title, as described below. The notification can include a link to the content 1730 so that the first user can view what content the second user wants the first user to delete. The message provided by the second user can also be included in the notification.
The notification can include a delete and/or keep button, which can both be included in the link provided in the notification. Clicking delete can delete the content from the application and send a second notification back to the second user informing the second user that the content has been deleted. Clicking keep can send a third notification to the second user. The third notification can inform the second user that the first user decided not to delete the content. The third notification can be sent after a particular amount of time has passed since the notification was sent.
Two or more users can request that the first user delete the content 1730. In such instances, the notification sent to the first user can include an aggregated list of the two or more users that requested that the first user delete the content 1730. In addition, the notification associated with the first user keeping or deleting the content 1730 can be sent to the two or more users.
B. Title Determination
In some instances, text overlaid on the content 1830 can be extrapolated starting from the top left of the content and progressing left to right and top to bottom. For example, a user can input text that includes “SHOW OFF your SETUP” in the content 1830. In such an example, the input text can be identified as a title of the content 1830. By assigning the input text as the title, a user receiving the content can receive a notification that states: “A user just posted a video, SHOW OFF your SETUP.” For another example, a user can insert “hot” in the top left corner of a content and “day” in the bottom right corner of the content. The title of the content might be “hot day.” The title can also be used to identify the content 1830 for other operations described herein, such as requesting to delete content.
C. Share Notification
When a first user shares a post of a second user to a third user or a group of users, the second user can be notified with the share notification 1930 that the first user shared the post. When the third user, or a member of the group of users, further shares the post, the second user can again be notified by a notification similar to the share notification 1930. In some instances, the first user can also be notified when the third user, or a member of the group of users, shares the post. The share notification 1930 can be received by the mobile device 1905. The share notification 1930 can be displayed on a screen 1920 of the mobile device 1905.
The share notification 1930 can include a name of the application that was used to share the post. The share notification 1930 can further include a message 1950. The message 1950 can include an identification of a user that shared the post and a user that received the shared post (e.g., “Jeff shares with Erin”). The message 1950 can further include one or more buttons. The one or more buttons can include a “Close” button 1960 that dismisses the share notification 1930. The one or more buttons can include a “View” button 1970 that allows a user to view what was shared. In other instances, the “View” button 1970 can allow a user to view at least one or more of a user that shared the post and a user that received the shared post.
The share notification 1930 can also include an identification of the post (e.g., “SHOW OFF your SETUP”). The identification of the post can be generated based on a title determination as discussed herein.
IX. Audio Magic WordsA system can detect whether a particular one or more words are present in a video. If the one or more words are detected, visual and/or audio effects can be applied to content (e.g., picture, video, message, etc.) based on the one or more words. For example, the words “Happy Birthday” in an audio portion of a video can trigger an effect to the audio portion or a visual portion of the video. A visual effect can include adding additional content to be displayed on top of the content. An audio effect can include changing or adding sound to content to change what a user hears when viewing the content.
In some instances, after searching the audio portion for a phrase of one or more words, a server, or application, can access a table to determine one or more effects that can be applied to the video. The table can be associated with the phrase identified in the audio portion. The one or more effects can correspond to the phrase identified in the audio portion.
In some instances, the system can detect a sentiment that is associated with one or more words. The sentiment can be determined from one or more words that were extracted from an audio portion in a number of ways known by a person of ordinary skill in the art. See e.g.,
Kaushik, L., Sangwan, A., & Hansen, J. H. (2013). Sentiment extraction from natural audio streams. 2013 IEEE International Conference on Acoustics, Speech and Signal Processing. doi:10.1109/icassp.2013.6639321. For example, rather than just looking for the word “Disneyland,” the system can identify that the user did not have a good time at Disneyland. The system might determine that a user says “Disneyland was horrible.” In such an example, an effect might not be applied to the video.
The one or more effects from the table can be provided to a user of a mobile device. The user can select a first effect of the one or more effects to be applied to the video. By selecting the first effect, a notification can be sent to the server, or the application, that includes an identification of the effect selected. The server, or the application, can then apply the effect to at least one or more of the visual portion and the audio portion of the video.
In some instances, a user can flip through one or more possible effects associated with the phrase. For example, if a system detects “Happy Birthday,” the user might be able to flip through fireworks, a cake, balloons, a high pitched voice, and other effects that deal with a birthday. In some instances, a left arrow and a right arrow (or a left swipe left and/or a right swipe on a screen of a mobile device) can be used to flip through different possibilities. In other instances, a menu can be generated for a user to pick an effect from a list of one or more effects.
A. Visual Effects
Process 2100 is illustrated as a logical flow diagram, the operation of which represent a sequence of operations that can be implemented in hardware, computer instructions, or a combination thereof. In the context of computer instructions, the operations represent computer-executable instructions stored on one or more computer-readable storage media that, when executed by one or more processors, perform the recited operations. Generally, computer-executable instructions include routines, programs, objects, components, data structures, and the like that perform particular functions or implement particular data types. The order in which the operations are described is not intended to be construed as a limitation, and any number of the described operations can be combined in any order and/or in parallel to implement the processes.
Additionally, the process 2100 can be performed under the control of one or more computer systems configured with executable instructions and may be implemented as code (e.g., executable instructions, one or more computer programs, or one or more applications) executing collectively on one or more processors, by hardware, or combinations thereof. As noted above, the code can be stored on a machine-readable storage medium, for example, in the form of a computer program comprising a plurality of instructions executable by one or more processors. The machine-readable storage medium can be non-transitory.
At step 2110, the process 2100 includes receiving a video. In some examples, the video can be received by a server that is hosting an application. In other examples, the video can be received by an application that is installed on a mobile device. In some examples, the video can be recorded on a mobile phone. The video can includes at least one or more of a visual portion and an audio portion.
At step 2120, the process 2100 further includes searching the audio portion for a phrase of one or more words. The searching can be performed by an audio recognition software that can transcribe at least a portion of the audio portion. The transcription can then be searched for the phrase. Other methods of searching the audio portion for the phrase can be performed. For example, one words can be transcribed at a time. When a word matches at least a portion of the phrase, the surrounding words can be combined with the word to see if the phrase is present in the audio portion.
At step 2130, the process 2100 further includes accessing a table associated with the phrase to determine one or more effects that can be applied to the video. The table can be stored on the server. In other embodiments, the table can be stored remotely from the server. In such embodiments, the table can be accessible from the server. The one or more effects can correspond to the phrase identified in the audio portion.
At step 2140, the process 2100 further includes providing the one or more effects to a user of a mobile device. The one or more effects can be provided by a list of possible effects. In other examples, an effect of the one or more effects can be performed on the video to show a user what would be the result. In such embodiments, a user can cycle through the one or more effects to see what other effects can be used on the video. The mobile device can be the same or different than the mobile device that recorded the video.
At step 2150, the process 2100 further includes receiving a selection of a first effect of the one or more effects. The first effect can change at least one or more of the visual portion and the audio portion of the video. If the first effect is associated with the audio portion, applying the first effect to the audio portion can change how the audio portion sounds. If the first effect is associated with visual portion, applying the first effect to the visual portion can change how the video looks.
At step 2160, the process 2100 further includes applying the first effect to the video. Applying the first effect to the video can include matching the audio portion after the first effect is applied to the audio portion before the first effect is applied. Matching can include ensuring that the audio portion is the same length of time before and after the first effect is applied.
B. Audio Effects
A user can swipe a screen 2220 of the mobile device 2205 to have a different audio effect applied to the audio, or the audio portion (e.g., Darth Vader, chipmunk, high pitch, slow, reverse, or any other altered sound). In other instances, the user can pick a type of audio effect using a radio button (e.g., radio button 2052). The radio button can be associated with a form (e.g., Darth Vader 2050, Chipmunk 2060, High Pitch 2070, Slow 2080, and Reverse 2090). Other examples of audio effects can include sounding like an old person and a young person.
The audio effects available can be based on one or more words identified in an audio, or audio portion. For example, the system can detect the words “I am your father.” In such an example, the system can apply an audio effect in order to make the voice sound like Darth Vader.
The audio can change based on changing one or more audio waves associated with the audio. When changing the one or more audio waves for a video, timing of the audio can be augmented to maintain synchronous with a video portion of the video. Maintaining synchronous can include ensuring that the audio last a particular amount of time. For example, sounding like a chipmunk can be achieved by speeding up the audio. However, if the audio is merely sped up, the audio will not match the visual of the mouth. Therefore, the audio waves can be manipulated in a way that does not change the length of time of the audio.
The server can identify one or more people speaking in audio so that the server can augment only individual people. For example, the server can change one voice to sound like Darth Vader while changing another voice to sound like a chipmunk. In other instances, the server can distinguish between a voice of a person and other voice (e.g., ambient noise, machine noise, non-human noise, etc.).
X. Scroll RecognitionThe scroll recognition can be based on a total length of the list. For example, if there are nine items in a list, the scroll recognition might not activate. However, when there are thousands of items in a list, the system can implement the scroll recognition. The scroll recognition can be based on a change of behavior of a user. The change of behavior can include a velocity of a swipe, an amount of time a user has been swiping, whether or not the user intercepted a swipe, other characteristics that can indicate that the user desires to skip two or more items in a list, or any combination therein. The scroll recognition can use multiple characteristics described herein. For example, the scroll recognition can be based on a total length of the list, a velocity of a swipe, and an amount of time a user has been swiping.
The scroll recognition can be based on a velocity of a swipe. In such instances, the velocity of a swipe can be compared to one or more velocities of one or more previous swipe to determine whether the velocity of the swipe is quick in comparison. A high velocity can be determined based on at least one or more of device type, rendering speed, and finger speed.
The scroll recognition can be based on an amount of time a user has been swiping in a consistent direction. For example, if an user has swiped up 10 times, or for a period of time, the scroll recognition can activate. The scroll recognition can also be based on an amount of time a single swipe takes to complete. The amount of time the single swipe takes to complete can be calculated by determining a first time when a finger is touching a screen and a second time when a finger is no longer touching the screen. The amount of time can equal the difference between the first time and the second time.
The scroll recognition can be based on whether a user has intercepted a scroll. For example, if a user is allowing the system to scroll the full length of a swipe, the system can determine that the user desires to skip one or more items in a list.
When the scroll recognition is activated, a current scroll can slow down, or stop, and allow the user to pick from a list of one or more choices. The list of one or more choices can include a number of items to skip. For example, the choices can include jump 100 items, jump 100 items, go back to the start of the list, jump a day, or any other number that can be transformed into a number of items to skip.
XI. Group-Based SubmissionA user can receive an option to send to other groups even when the user is in a conversation. In such instances, the user can attempt to post content to one or more groups or conversations. And, rather than merely posting to the group or conversation that the user is currently in, the user can receive an option to post to one or more groups. The one or more groups can include, or exclude, the group or conversation that the user is currently in. In some instances, the post can automatically be posted in the current group or conversation.
In some embodiments, instead of merely posting a copy of original content to one or more locations, group-based submission can allow a user to post the original content to each of the groups. In such embodiments, the original content would not be posted to a particular location, that is later shared to other groups.
When content is received by a user, the user cannot tell whether the content was sent to other users. In such instances, the content can be sent to each user individually even though the user that posted the content posted to the group as a whole. In some instances, a user in a first group cannot cannot tell whether the content was sent to a second groups. In other instances, the users in a group can see that the other users in the group received the content.
XII. Example Subsystems and ComponentsIt should be understood that any of the embodiments of the present disclosure can be implemented in the form of control logic using hardware (e.g. an application specific integrated circuit or field programmable gate array) and/or using computer software with a generally programmable processor in a modular or integrated manner. As used herein, a processor can include a multi-core processor on a same integrated chip, or multiple processing units on a single circuit board or networked. Based on the disclosure and teachings provided herein, a person of ordinary skill in the art will know and appreciate other ways and/or methods to implement embodiments of the present disclosure using hardware and a combination of hardware and software.
Any of the software components or functions described in this application can be implemented as software code to be executed by a processor using any suitable computer language such as, for example, Java®, C++, C, Python, or Perl using, for example, conventional programming techniques, functional programming techniques, or object-oriented techniques. The software code can be stored as a series of instructions or commands on a computer readable medium for storage and/or transmission, suitable media include random access memory (RAM), a read only memory (ROM), a magnetic medium such as a hard-drive or a floppy disk, or an optical medium such as a compact disk (CD) or DVD (digital versatile disk), flash memory, and the like. The computer readable medium can be any combination of such storage or transmission devices.
Such programs can also be encoded and transmitted using carrier signals adapted for transmission via wired, optical, and/or wireless networks conforming to a variety of protocols, including the Internet. As such, a computer readable medium according to an embodiment of the present disclosure can be created using a data signal encoded with such programs. Computer readable media encoded with the program code can be packaged with a compatible device or provided separately from other devices (e.g., via Internet download). Any such computer readable medium can reside on or within a single computer program product (e.g. a hard drive, a CD, or an entire computer system), and can be present on or within different computer program products within a system or network. A computer system can include a monitor, printer, or other suitable display for providing any of the results mentioned herein to a user.
Any of the methods described herein can be totally or partially performed with a computer system including one or more processors, which can be configured to perform the steps. Thus, embodiments can be directed to computer systems configured to perform the steps of any of the methods described herein, potentially with different components performing a respective step or a respective group of steps. Although presented as numbered steps, steps of methods herein can be performed at a same time or in a different order. Additionally, portions of these steps can be used with portions of other steps from other methods. Also, all or portions of a step can be optional. Additionally, any of the steps of any of the methods can be performed with modules, circuits, or other means for performing these steps.
The specific details of particular embodiments can be combined in any suitable manner without departing from the spirit and scope of embodiments of the disclosure. However, other embodiments of the disclosure can be directed to specific embodiments relating to each individual aspect, or specific combinations of these individual aspects.
The above description of exemplary embodiments of the disclosure has been presented for the purposes of illustration and description. It is not intended to be exhaustive or to limit the disclosure to the precise form described, and many modifications and variations are possible in light of the teaching above. The embodiments were chosen and described in order to best explain the principles of the disclosure and its practical applications to thereby enable others skilled in the art to best utilize the disclosure in various embodiments and with various modifications as are suited to the particular use contemplated.
A recitation of “a”, “an” or “the” is intended to mean “one or more” unless specifically indicated to the contrary.
Claims
1. A method of altering video clip objects by a first computing device, the method comprising:
- transmitting from the first computing device, a video clip object comprising a first video clip to a first plurality of computing devices;
- receiving at the first computing device, a second video clip from at least one of the first plurality of computing devices in response to said first video clip of the first video clip object; and
- altering by the first computing device, the first video clip object to include the first video clip and the second video clip, wherein said altering yields an altered video clip object.
2. The method of claim 1 further comprising transmitting the altered video clip object to a second plurality of computing devices from the first computing device.
3. The method of claim 1 further comprising receiving at the first computing device, a response video clip from each of the first plurality of computing devices, resulting in a plurality of response video clips.
4. The method of claim 3 further comprising:
- selecting a set of response video clips from the plurality of response video clips;
- identifying a sequence for the set of response video clips; and
- producing a sequenced set of video clips from said identifying.
5. The method of claim 4 further comprising altering the video clip object to include the sequenced set of response video clips, wherein said altering yields an altered second video clip object.
6. The method of claim 4, wherein identifying the sequence for the set of response video clips comprises determining the sequence based on characteristics associated with one or more computing devices of the first plurality of computing devices.
7. The method of claim 1 further comprising generating the video clip object in response to obtaining the first video clip at the first computing device.
8. The method of claim 1, wherein receiving the second video clip comprises receiving metadata indicating an association with the video clip object.
9. The method of claim 8 further comprising joining the second video clip with the first video clip in the video clip object based on said metadata of the second video clip.
10. A method of combining video clips, the method comprising:
- displaying a first video clip on a first computing device comprising a camera;
- recording a second video clip with said camera during said displaying of the first video clip;
- combining the first video clip and the second video clip by one or more of:
- rendering the first video clip and second video clip together, resulting in a third video clip where the second video clip is rendered in a window in the first video clip; and
- transmitting the first video clip and second video clip to a second computing device after said displaying ends.
11. The method of claim 10 further comprising stopping recording of the second video clip with said camera in response to concluding said displaying of the first video clip.
12. The method of claim 10 further comprising displaying the first video clip and the second video clip together at the second computing device in response to said transmitting.
13. The method of claim 10, wherein recording the second video clip comprises collecting metadata about a position of the second video clip in relation to playback of the first video clip.
14. The method of claim 13, wherein said combining comprises aligning frames of the second video clip with frames of the first video clip based on said metadata.
15. A method for applying a visual effect to a video based on an audio portion, the method comprising:
- receiving a video comprising a visual portion and an audio portion;
- searching the audio portion for a phrase of one or more words;
- matching said phrase to one or more effects that can be applied to the video;
- providing the one or more effects to a user of a mobile device;
- receiving a selection of a first effect of the one or more effects, wherein the first effect changes at least one or more of the visual portion and the audio portion of the video; and
- applying the first effect to the video.
16. The method of claim 15, wherein said applying comprises inserting the first effect as new sound to the audio portion of the video.
17. The method of claim 16, wherein said applying further comprises synchronizing playback of said new sound from the first effect to a playback position of said phrase in the audio portion.
18. The method of claim 15, wherein said applying comprises overlaying graphics onto the visual portion of the video.
19. The method of claim 18, wherein said applying further comprises synchronizing playback of said graphics from the first effect to a playback position of said phrase in the audio portion.
20. The method of claim 15, wherein said applying comprising modifying an audio segment of the audio portion about the phrase with a sound filter selected as the first effect.
Type: Application
Filed: Jun 9, 2017
Publication Date: Dec 14, 2017
Patent Grant number: 11386931
Inventors: Jason A. Kilar (Hillsborough, CA), Richard W. Tom (San Francisco, CA), Lonn Lee (Belmont, CA), Bao Lei (San Bruno, CA), Zachary Pinter (San Francisco, CA), Lindsay Monroe (San Francisco, CA), Yilli Aiwazian (San Francisco, CA)
Application Number: 15/618,280