METHOD AND APPARATUS FOR MULTIMEDIA CONTENT FILTERING
Method and apparatus for multimedia content filtering are described herein. In one embodiment, an example of a network access device, in response to multimedia content transmitted from a source over a first network and destined to a destination over a second network, opens the multimedia content within the network access device interfacing the first and second networks. A content rating operation is performed on the opened multimedia content to determine whether the multimedia content should be transmitted to the destination over the second network. Other methods and apparatuses are also described.
This application is a continuation and claims the priority benefit of U.S. patent application Ser. No. 11/236,280 filed Sep. 26, 2005, issuing as U.S. Pat. No. 9,275,047, the disclosure of which is incorporated herein by reference.
FIELD OF THE INVENTIONThe present invention relates generally to content filtering. More particularly, this invention relates to multimedia content filtering.
BACKGROUNDToday, various content filtering mechanisms are provided to entities to manage and/or control user access to the Internet via facilities provided by the entities. For example, a company typically implements some forms of content filtering mechanisms to control the use of the company's computers and/or servers to access the Internet. Access to content within certain predetermined categories using the company's computers and/or servers may not be allowed during some predetermined periods of time.
For example, a typical content filtering client, which typically resides within a firewall, sends a request for the content rating of a web page in response to each web page browsed. The content rating requests are routed to a separate content rating server. When the content rating server receives a request, the content rating server retrieves the content rating for that request from a database and sends the content rating to the content filtering client. Based on the content rating retrieved, the content filtering client determines whether the user is allowed to access the web page. If the user is allowed, the content filtering client passes the web page. Otherwise, the content filtering client blocks the web page.
As the amount and type of multimedia content on the Web keep growing, the ability to access a large amount of multimedia content from the Web raises new questions on how to classify or rate such content, in order to provide content security. Although conventional content filtering techniques work well on text-based content, such techniques are not able or work poorly to perform content filtering on multimedia content, such as, for example, audio and/or video content.
SUMMARY OF THE DESCRIPTIONMethod and apparatus for multimedia content filtering are described herein. In one embodiment, an example of a network access device, in response to multimedia content transmitted from a source over a first network and destined to a destination over a second network, opens the multimedia content within the network access device interfacing the first and second networks. A content rating operation is performed on the opened multimedia content to determine whether the multimedia content should be transmitted to the destination over the second network.
Other features of the present invention will be apparent from the accompanying drawings and from the detailed description which follows.
The present invention is illustrated by way of example and not limitation in the figures of the accompanying drawings in which like references indicate similar elements.
Method and apparatus for multimedia content filtering are described herein. In the following description, numerous details are set forth to provide a more thorough explanation of embodiments of the present invention. It will be apparent, however, to one skilled in the art, that embodiments of the present invention may be practiced without these specific details. In other instances, well-known structures and devices are shown in block diagram form, rather than in detail, in order to avoid obscuring embodiments of the present invention.
Reference in the specification to “one embodiment” or “an embodiment” means that a particular feature, structure, or characteristic described in connection with the embodiment is included in at least one embodiment of the invention. The appearances of the phrase “in one embodiment” in various places in the specification do not necessarily all refer to the same embodiment.
Accordingly, multimedia processing techniques are utilized to perform content filtering and/or content rating on multimedia content. For example, if network traffic going through a network appliance, such as a network access device or gateway, includes an audio stream, speech recognition provides a way to classify the audio stream that is being downloaded or played over a network (e.g., Internet). According to one embodiment, if the network traffic includes, for example, an audio stream detected by a network access device, the detected audio stream may be extracted from the network traffic by the network access device before the audio stream reaches a destination. The extracted audio stream may be fed to a speech recognition unit to classify or identify a pattern or patterns of the audio stream and a content rating is performed on the recognized audio stream. Thereafter, the network access device may decide whether the audio stream may be transmitted to the destination based on the content rating.
In one embodiment, the classification of the multimedia content may be performed based on a vocabulary or on detecting semantics in the multimedia content. The content categories may be used to allow or block user access to various types of content, which may be tailored around specific users, groups, IP subnets, time of day or week or other relevant policies that may apply.
Note that throughout this application, an audio stream is used as an example of multimedia content for the purposes of illustration. But it is not so limited; it will be appreciated that other types of multimedia content, such as, for example, images and/or video streams, may also be applied. Also note that throughout this application, a network access device (e.g., a gateway device) is used as an example to perform the techniques described herein for the purposes of illustration. But it is not so limited; it will be appreciated that other devices may also be used to perform the techniques described throughout this application.
In one embodiment, network access device 101 includes, but is not limited to, a multimedia processing unit 105, in response to multimedia content (e.g., an audio stream) transmitted from the source 102 over a first network 109 (e.g., WAN) and destined to the destination 103 of a second network 110 (e.g., LAN), to open the multimedia content within the network access device 101. The network access device 101 further includes a content filtering unit 104 coupled to the multimedia processing unit 105 to perform content rating on the opened multimedia content to determine whether the multimedia content should be transmitted to the destination over the second network 110.
The above operations may be performed in real-time (e.g., on the fly) by the network access device 101. The above operations may be performed without letting some or all of the multimedia content to reach the destination (e.g., node 103). In one embodiment, the network access device 101 may further include an optional buffer or buffers 106 to temporarily store the multiple media content while the above operations are being performed.
In one embodiment, if the multimedia content includes an audio stream for example, a speech recognition engine may be invoked to perform recognizing a pattern or patterns of the audio stream. For example, the speech recognition unit may recognize or identify certain patterns within the audio stream. As a result, the content filtering unit 104 (e.g., content rating unit) may perform content rating on the recognized patterns. Thereafter, the content filtering unit 104 may decide whether to allow the multimedia content to be passed to the destination 103 based on the content rating on the audio stream.
In one embodiment, the audio patterns may be determined based on at least one of vocabulary and/or semantics of the recognized patterns. In another embodiment, the audio patterns may be classified based on at least one of users and/or groups associated with the destination and/or source nodes, IP subnets and/or time associated with the multimedia content.
According to one embodiment, operations performed by the network access device 101 may include, for example, at least one of anti-spyware, firewall, content filtering, intrusion detection and prevention, and/or IPSec VPNs (Virtual Private Networks), etc. functionality. Other components may be included.
In one embodiment, the connection between the network access device 101 and the network 109 may be a wired connection. Alternatively, such a connection may be a wireless connection; for example, a satellite or an IEEE 802.16 connection. The network 109 may be a wide area network (WAN), such as, for example, the Internet. The network 110 may be a home network using an IEEE 802.1x compatible protocol. Alternatively, the network 110 may be a local network within an organization (e.g., an Intranet). The network 110 may be a wired or wireless network, or a combination of both, using a variety of network protocols, such as, Ethernet and/or IEEE 802.xx compatible protocols, such as, for example, Wi-Fi and/or Bluetooth protocols. Wireless connections may include both RF and non-RF links, for example, an IR link. Wired connections may include both electrical and non-electrical links, for example, fiber optic links. The content rating engine (not shown) may include one or more engines (e.g., one for token-based rating and one for link-based rating, etc.) The content rating engine may be implemented as a part of network access device 101. Alternatively, the content rating engine may be implemented as a dedicated server within network 110, or a Web server coupled to the network 109.
Note that the first network 109 and the second network 110 may be the same network. For example, a computer node may play a movie from a network storage residing on the same network and still enjoy multimedia content filtering services if the network traffic traverses an appliance that implements the services. Other configurations may exist.
In one embodiment, process 200 includes, but is not limited to, in response to multimedia content transmitted from a source over a first network and destined to a destination over a second network, opening the multimedia content within a network access device interfacing the first and second networks, and performing content rating on the opened multimedia content to determine whether the multimedia content should be transmitted to the destination over the second network.
Referring to
Referring to
In response to the recognized patterns of the audio stream, classification module 304 may perform content classification on the recognized audio stream based on, for example, a vocabulary or semantics of the audio stream. In response to the classification, action module 305 may take appropriate actions on the audio stream based on one or more policies 306, which may be set up by an administrator or user. For example, certain offensive words may be removed or masked out in response to the detection and/or classification.
In one embodiment, the audio codec 302 may be pluggable to facilitate support of audio streams in various formats. In addition, a flexible interface with the speech recognition module 303 allows for plugging in proprietary or third party speech recognition engine, which may be implemented locally or remotely over a network. Further, various content categories are utilized (e.g., configurable within policies 306) to allow or to block user access to various types of content. Such content categories may be tailored around specific users, groups, IP subnets, time of day or week or other relevant policies. Other configurations may exist.
Access to the Internet 422 is typically provided by Internet service providers (ISPs), such as the ISP 424, and the ISP 426. Users on client systems, such as the client computer systems 402, 404, 418, and 420, generally obtain access to the Internet through Internet service providers, such as ISPs 424 and 426. Access to the Internet may facilitate transfer of information (e.g., email, text files, media files, etc.) between two or more digital processing systems, such as the client computer systems 402, 404, 418, and 420 and/or a Web server system 428.
For example, one or more of the client computer systems 402, 404, 418, and 420 and/or the Web server 428 may provide document presentations (e.g., a Web page) to another one or more of the client computer systems 402, 404, 418, and 420 and/or Web server 428. For example, in one embodiment of the invention, one or more client computer systems 402, 404, 418, and 420 may request to access a document that may be stored at a remote location, such as the Web server 428. In the case of remote storage, the data may be transferred as a file (e.g., download) and then displayed (e.g., in a window of a browser) after transferring the file. In another embodiment, the document presentation may be stored locally at the client computer systems 402, 404, 418, and/or 420. In the case of local storage, the client system may retrieve and display the document via an application, such as a word processing application, without requiring a network connection.
The Web server 428 typically includes at least one computer system to operate with one or more data communication protocols, such as the protocols of the World Wide Web and, as such, is typically coupled to the Internet 422. Optionally, the Web server 428 may be part of an ISP which may provide access to the Internet and/or other network(s) for client computer systems. The client computer systems 402, 404, 418, and 420 may each, with appropriate Web browsing software, access data, such as HTML document (e.g., Web pages), which may be provided by the Web server 428.
The ISP 424 provides Internet connectivity to the client computer system 402 via a modem interface 406, which may be considered as part of the client computer system 402. The client computer systems 402, 404, 418, and 420 may be a conventional data processing system, such as a Power Mac G5 or iMac computer available from Apple Computer, or an IBM compatible PC, a “network” computer, a handheld/portable computer, a cell phone with data processing capabilities, a Web TV system, or other types of digital processing systems (e.g., a personal digital assistant (PDA)).
Similarly, the ISP 426 provides Internet connectivity for the client computer systems 402, 404, 418, and 420. However, as depicted in
The client computer systems 418 and 420 are coupled to a LAN bus 412 through network interfaces 414 and 416, respectively. The network interface 414 and 416 may be an Ethernet-type, asynchronous transfer mode (ATM), or other type of network interface. The LAN bus is also coupled to a gateway digital processing system 410, which may provide firewall and other Internet-related services for a LAN. The gateway system 410, in turn, is coupled to the ISP 426 to provide Internet connectivity to the client computer systems 418 and 420. The gateway digital processing system 410 may, for example, include a conventional server computer system. Similarly, the Web server 428 may, for example, include a conventional server computer system.
In one embodiment, the local area network 412 may be local wireless network (e.g., a home network) and the gateway 410 may include a wireless access point (also referred to as a base station) to one or more clients 418 and 420 using a variety of wireless networking protocols; for example, the IEEE 802.xx protocols including Wi-Fi and/or Bluetooth protocols. In a further embodiment, the gateway 410 may access the server 428 via dialup network services using a modem.
According to one embodiment, the content filtering techniques on multimedia content described above, may be implemented with any of the network access devices, such as, modems 406 and 408, and/or gateway 410.
Note, that while
As shown in
Typically, the input/output devices 510 are coupled to the system through input/output controllers 509. The volatile RAM 505 is typically implemented as dynamic RAM (DRAM) which requires power continuously in order to refresh or maintain the data in the memory. The non-volatile memory 506 is typically a magnetic hard drive, a magnetic optical drive, an optical drive, or a DVD RAM or other type of memory system which maintains data even after power is removed from the system. Typically, the non-volatile memory will also be a random access memory, although this is not required.
While
Thus, method and apparatus for multimedia content filtering are described herein. Some portions of the preceding detailed descriptions have been presented in terms of algorithms and symbolic representations of operations on data bits within a computer memory. These algorithmic descriptions and representations are the ways used by those skilled in the data processing arts to most effectively convey the substance of their work to others skilled in the art. An algorithm is here, and generally, conceived to be a self-consistent sequence of operations leading to a desired result. The operations are those requiring physical manipulations of physical quantities. Usually, though not necessarily, these quantities take the form of electrical or magnetic signals capable of being stored, transferred, combined, compared, and otherwise manipulated. It has proven convenient at times, principally for reasons of common usage, to refer to these signals as bits, values, elements, symbols, characters, terms, numbers, or the like.
It should be borne in mind, however, that all of these and similar terms are to be associated with the appropriate physical quantities and are merely convenient labels applied to these quantities. Unless specifically stated otherwise as apparent from the above discussion, it is appreciated that throughout the description, discussions utilizing terms such as “processing” or “computing” or “calculating” or “determining” or “displaying” or the like, refer to the action and processes of a computer system, or similar electronic computing device, that manipulates and transforms data represented as physical (electronic) quantities within the computer system's registers and memories into other data similarly represented as physical quantities within the computer system memories or registers or other such information storage, transmission or display devices.
Embodiments of the present invention also relate to an apparatus for performing the operations herein. This apparatus may be specially constructed for the required purposes, or it may comprise a general-purpose computer selectively activated or reconfigured by a computer program stored in the computer. Such a computer program may be stored in a computer readable storage medium, such as, but is not limited to, any type of disk including floppy disks, optical disks, CD-ROMs, and magnetic-optical disks, read-only memories (ROMs), random access memories (RAMs), erasable programmable ROMs (EPROMs), electrically erasable programmable ROMs (EEPROMs), magnetic or optical cards, or any type of media suitable for storing electronic instructions, and each coupled to a computer system bus.
The algorithms and displays presented herein are not inherently related to any particular computer or other apparatus. Various general-purpose systems may be used with programs in accordance with the teachings herein, or it may prove convenient to construct more specialized apparatus to perform the required method operations. The required structure for a variety of these systems will appear from the description below. In addition, embodiments of the present invention are not described with reference to any particular programming language. It will be appreciated that a variety of programming languages may be used to implement the teachings of embodiments of the invention as described herein.
A machine-readable medium may include any mechanism for storing or transmitting information in a form readable by a machine (e.g., a computer). For example, a machine-readable medium includes read only memory (“ROM”); random access memory (“RAM”); magnetic disk storage media; optical storage media; flash memory devices; etc.
In the foregoing specification, embodiments of the invention have been described with reference to specific exemplary embodiments thereof. It will be evident that various modifications may be made thereto without departing from the broader spirit and scope of the invention as set forth in the following claims. The specification and drawings are, accordingly, to be regarded in an illustrative sense rather than a restrictive sense.
Claims
1. A method for multimedia content filtering, the method comprising:
- receiving at a network access device, a stream of multimedia content transmitted from a source over a first network and destined for a destination over a second network; and
- executing instructions stored in memory of the network access device, wherein execution of the instructions by a processor: extracts audio content from the multimedia content in real-time while the multimedia content is streaming, recognizes an audio pattern within the extracted audio content, determines that the stream of multimedia content should not be completed based on a policy associated with the recognized audio pattern, and terminates the stream of the multimedia content.
2. The method of claim 1, wherein the determination that the stream of media content should not be completed is performed on a packet level.
3. The method of claim 1, further comprising extracting other multi-media content in real-time for filtering.
4. The method of claim 3, wherein the other multi-media content includes images.
5. The method of claim 3, wherein the other multi-media content includes video content.
6. The method of claim 1, wherein the determination that the stream of media content should not be completed is based on a list of vocabulary or semantics found to be offensive to a user.
7. The method of claim 1, wherein the determination that the stream of media content should not be completed is customizable around one or more of the following: specific users, groups, IP subnets, or time.
8. A non-transitory machine readable storage medium having embodied thereon executable code that when executed causes a machine to perform a method for multimedia content filtering, the method comprising:
- receiving at a network access device, a stream of multimedia content transmitted from a source over a first network and destined for a destination over a second network;
- extracting audio content from the multimedia content in real-time while the multimedia content is streaming;
- recognizing an audio pattern within the extracted audio content, determining that the stream of multimedia content should not be completed based on a policy associated with the recognized audio pattern, and
- terminating the stream of the multimedia content.
9. The non-transitory machine readable storage medium of claim 8, wherein the determination that the stream of media content should not be completed is performed on a packet level.
10. The non-transitory machine readable storage medium of claim 8 further comprising extracting other multi-media content in real-time for filtering.
11. The non-transitory machine readable storage medium of claim 10, wherein the other multi-media content includes images.
12. The non-transitory machine readable storage medium of claim 10, wherein the other multi-media content includes video content.
13. The non-transitory machine readable storage medium of claim 8, wherein the determination that the stream of media content should not be completed is based on a list of vocabulary or semantics found to be offensive to a user.
14. The non-transitory machine readable storage medium of claim 8, wherein the determination that the stream of media content should not be completed is customizable around one or more of the following: specific users, groups, IP subnets, or time.
15. A system for multimedia content filtering, the system comprising:
- a network access device that receives a stream of multimedia content transmitted from a source over a first network and destined for a destination over a second network;
- an audio codec that extracts audio content from the multimedia content in real-time while the multimedia content is streaming;
- a speech recognition engine that recognizes an audio pattern within the extracted audio content; and
- a content filtering unit that: determines that the stream of multimedia content should not be completed based on a policy associated with the recognized audio pattern, and terminates the stream of the multi media content.
16. The system of claim 15, wherein the determination that the stream of media content should not be completed is performed on a packet level.
17. The system of claim 15, further comprising extracting other multi-media content in real-time for filtering.
18. The system of claim 17, wherein the other multi-media content includes images.
19. The system of claim 17, wherein the other multi-media content includes video content.
20. The system of claim 15, wherein the determination that the stream of media content should not be completed is based on a list of vocabulary or semantics found to be offensive to a user.
21. The system of claim 15, wherein the determination that the stream of media content should not be completed is customizable around one or more of the following: specific users, groups, IP subnets, or time.
Type: Application
Filed: Mar 1, 2016
Publication Date: Jun 23, 2016
Inventors: Roman Yanovsky (Los Altos, CA), Igor Korsunsky (Sunnyvale, CA), John Gmuender (San Jose, CA), Boris Yanovsky (Saratoga, CA)
Application Number: 15/057,747