APPARATUSES, SYSTEMS AND METHODS FOR INTEGRATING VEHICLE OPERATOR GESTURE DETECTION WITHIN GEOGRAPHIC MAPS
Apparatuses, systems and methods are provided for vehicle operator gesture recognition and transmission of related gesture data. More particularly, apparatuses, systems and methods are provided for vehicle operator gesture recognition and transmission of related gesture data to at least one geographic map programming interface.
This application is a continuation of U.S. patent application Ser. No. 15/875,634, filed Jan. 19, 2018, and entitled APPARATUSES, SYSTEMS AND METHODS FOR INTEGRATING VEHICLE OPERATOR GESTURE DETECTION WITHIN GEOGRAPHIC MAPS, which claims priority, under 35 U.S.C. § 119(b), to U.S. Provisional Patent Application Ser. No. 62/448,041, filed on Jan. 19, 2017, and entitled APPARATUSES, SYSTEMS AND METHODS FOR INTEGRATING VEHICLE OPERATOR GESTURE DETECTION WITHIN GEOGRAPHIC MAPS, the entire disclosures of each of which are incorporated herein by reference.
The present application is related to U.S. patent application Ser. No. 14/994,299, entitled APPARATUSES, SYSTEMS AND METHODS FOR ACQUIRING IMAGES OF OCCUPANTS INSIDE A VEHICLE, filed Jan. 13, 2016; Ser. No. 14/994,305, entitled APPARATUSES, SYSTEMS AND METHODS FOR CLASSIFYING DIGITAL IMAGES, filed Jan. 13, 2016; Ser. No. 14/994,308, entitled APPARATUSES, SYSTEMS AND METHODS FOR CLASSIFYING DIGITAL IMAGES, filed Jan. 13, 2016; Ser. No. 14/994,310, entitled APPARATUSES, SYSTEMS AND METHODS FOR COMPRESSING IMAGE DATA THAT IS REPRESENTATIVE OF A SERIES OF DIGITAL IMAGES, filed Jan. 13, 2016; Ser. No. 14/994,409, entitled APPARATUSES, SYSTEMS AND METHODS FOR DETERMINING DISTRACTIONS ASSOCIATED WITH VEHICLE DRIVING ROUTES, filed Jan. 13, 2016; Ser. No. 14/994,415, entitled APPARATUSES, SYSTEMS AND METHODS FOR GENERATING DATA REPRESENTATIVE OF VEHICLE DRIVER RATINGS, filed Jan. 13, 2016; Ser. No. 14/994,419, entitled APPARATUSES, SYSTEMS AND METHODS FOR GENERATING DATA REPRESENTATIVE OF VEHICLE OCCUPANT POSTURES, filed Jan. 13, 2016; Ser. No. 14/994,424, entitled APPARATUSES, SYSTEMS AND METHODS FOR TRANSITIONING BETWEEN AUTONOMOUS AND MANUAL MODES OF VEHICLE OPERATION, filed Jan. 13, 2016; Ser. No. 14/994,431, entitled APPARATUSES, SYSTEMS AND METHODS FOR DETERMINING WHETHER A VEHICLE IS BEING OPERATED IN AUTONOMOUS MODE OR MANUAL MODE, filed Jan. 13, 2016; Ser. No. 14/994,436, entitled APPARATUSES, SYSTEMS AND METHODS FOR DETERMINING VEHICLE OPERATOR DISTRACTIONS, filed Jan. 13, 2016; Ser. No. 14/994,440, entitled APPARATUSES, SYSTEMS AND METHODS FOR DETERMINING WHETHER A VEHICLE SYSTEM IS DISTRACTING TO A VEHICLE OPERATOR, filed Jan. 13, 2016; Ser. No. 14/862,949, entitled SYSTEMS AND METHODS FOR USING IMAGE DATA TO GENERATE VEHICLE OPERATION LOGS, filed Sep. 23, 2015; and Ser. No. 14/989,524, entitled SYSTEMS AND METHODS FOR ASSOCIATING VEHICLE OPERATORS WITH DRIVING MISSES INDICATED IN VEHICLE OPERATION DATA, filed Jan. 6, 2016; the disclosures of which are incorporated herein in their entireties by reference thereto.
TECHNICAL FIELDThe present disclosure is directed to apparatuses, systems and methods for detecting vehicle operator gestures and transmission of related gesture data. More particularly, the present disclosure is directed to apparatuses, systems and methods for integrating vehicle operator gesture recognition data within geographic maps.
BACKGROUNDVehicles are being provided with more complex systems. For example, vehicles commonly include a plethora of entertainment systems, such as stereos, USB interfaces for mobile telephones, video players, etc. Vehicles often have a host of other operator interfaces, such as emergency calling systems, vehicle navigation systems, heating and air conditioning systems, interior and exterior lighting controls, air bags, seatbelts, etc.
Vehicle operating environments are becoming more complex as well. For example, some roadways include u-turn lanes, round-a-bouts, no-left turn, multiple lanes one way in the morning and the other way in the afternoon, etc. Increases in traffic are also contributing to increased complexity.
These additional complexities contribute to increases in driver risk. What is needed are methods and systems for generating data representative of vehicle in-cabin insurance risk evaluations based on data representative of skeletal diagrams of a driver that are indicative of degrees of driver risk.
SUMMARYA device for determining vehicle operator gestures and incorporating the vehicle operator gestures within geographic maps may include a previously classified image data receiving module stored on a memory that, when executed by a processor, causes the processor to receive previously classified image data from at least one previously classified image database. The previously classified image data may be representative of previously classified vehicle occupant gestures. The device may also include a current image data receiving module stored on a memory that, when executed by a processor, causes the processor to receive current image data from at least one vehicle interior sensor. The current image data may be representative of current vehicle occupant gestures. The device may further include a current image data classification module stored on a memory that, when executed by a processor, causes the processor to classify the current image data by comparing the current image data to the previously classified image data. The currently classified image data may be representative of at least one current vehicle occupant gesture. The device may yet further include a currently classified image data transmission module stored on a memory that, when executed by a processor, causes the processor to transmit the currently classified image data to at least one geographic map application programming interface.
In another embodiment, a computer-implemented method for determining vehicle occupant gestures and for incorporating the vehicle occupant gestures within geographic maps may include receiving, at a processor of a computing device, previously classified image data from at least one previously classified image database in response to the processor executing a previously classified image data receiving module. The previously classified image data is representative of previously classified vehicle occupant gestures. The method may also include receiving, at a processor of a computing device, current image data from at least one vehicle interior sensor a current image data receiving module, in response to the processor executing a current image data receiving module. The current image data may be representative of at least one current vehicle occupant gesture. The method may further include classifying, using a processor of a computing device, at least one gesture associated with a vehicle occupant, based on a comparison of the current image data with the previously classified image data, in response to the processor executing a current image data classification module. The method may yet further include transmitting, using a processor of a computing device, the currently classified image data to at least one geographic map application programming interface, in response to the processor executing a currently classified image data transmission module.
In a further embodiment, a non-transitory computer-readable medium storing computer-readable instructions that, when executed by a processor, may cause the processor to determine vehicle occupant gestures and incorporate the vehicle occupant gestures within geographic maps. The non-transitory computer-readable medium may include a previously classified image data receiving module that, when executed by a processor, may cause the processor to receive previously classified image data from at least one previously classified image database. The previously classified image data may be representative of previously classified vehicle occupant gestures. The non-transitory computer-readable medium may also include a current image data receiving module that, when executed by a processor, may cause the processor to receive current image data from at least one vehicle interior sensor. The current image data may be representative of current vehicle occupant gestures. The non-transitory computer-readable medium may further include a current image data classification module that, when executed by a processor, may cause the processor to classify the current image data by comparing the current image data to the previously classified image data. The currently classified image data may be representative of at least one current vehicle occupant gesture. The non-transitory computer-readable medium may yet further include a currently classified image data transmission module that, when executed by a processor, may cause the processor to transmit the currently classified image data to at least geographic map application programming interface.
Apparatuses, systems and methods for integrating vehicle operator gesture detection within geographic maps are provided. For example, patterns in vehicle occupant gestures in aggregate may be detected (e.g., detecting a lot of head turns to one side). A hazard type (e.g., an accident, a traffic jam, a road closure, road construction, etc.) may be determined based on the patterns in vehicle occupant gestures. A location of the hazard may be determined based on geographic location data. Data related to the hazard type and/or the hazard location may be automatically transmitted to, for example, a geographic map application programming interface (API) (e.g., a Waze API, a BING API, a GOOGLE maps API, etc.). Thereby, a hazard type and/or hazard location may be incorporated within realtime geographic maps without manually entering any data.
The apparatuses, systems and methods described herein are directed to an improvement to computer functionality, and improve the functioning of conventional computers. For example, generation of data representative of degrees of vehicle operator risks may include the following capabilities: 1) determine whether a vehicle driver is looking at a road (i.e., tracking the driver's face/eyes, with emphasis on differentiating between similar actions, such as a driver who is adjusting a radio while looking at the road versus adjusting the radio while not looking at the road at all); 2) determine whether a driver's hands are empty (e.g., including determining an approximate size/shape of object in a driver's hands to, for example, differentiate between a cell phone and a large cup, for example); 3) identify a finite number of driver postures; and 4) logging rotated and scaled postures that are normalized for a range of different drivers.
An associated mobile application may accommodate all popular platforms, such as iOS, Android and Windows, to connect an onboard device to a cell phone. In addition to functioning as a data connection provider to remote servers, the mobile application may provide a user friendly interface for reporting and troubleshooting. Accordingly, associated memory, processing, and related data transmission requirements may be reduced compared to previous approaches.
Turning to
With reference to
For clarity, only one vehicle in-cabin device 205 is depicted in
The vehicle in-cabin device 205 may also include a compass sensor 227, a global positioning system (GPS) sensor 229, and a battery 223. The vehicle in-cabin device 205 may further include an image sensor input 235 communicatively connected to, for example, a first image sensor 236 and a second image sensor 237. While two image sensors 236, 237 are depicted in
As one example, a first image sensor 236 may be located in a driver-side A-pillar, a second image sensor 237 may be located in a passenger-side A-pillar, a first infrared sensor 241 may be located in a driver-side B-pillar, a second infrared sensor 242 may be located in a passenger-side B-pillar, first and second ultrasonic sensors 246, 247 may be located in a center portion of a vehicle dash and first and second microphones 251, 252 may be located on a bottom portion of a vehicle interior rearview mirror. The processor 215 may acquire position data from any one of, or all of, these sensors 236, 237, 241, 242, 246, 247, 251, 252 and generate at least one 3D model (e.g., a 3D model of at least a portion of a vehicle driver) based on the position data. The processor 215 may transmit data representative of at least one 3D model to the remote computing device 210. Alternatively, the processor 215 may transmit the position data to the remote computing device 210 and the processor 255 may generate at least one 3D model based on the position data. In either event, the processor 215 or the processor 255 retrieve data representative of a 3D model of a vehicle operator and compare the data representative of the 3D model of at least a portion of the vehicle driver with data representative of at least a portion of the 3D model vehicle operator. The processor 215 and, or the processor 255 may generate a vehicle driver warning based on the comparison of the data representative of the 3D model of at least a portion of the vehicle driver with data representative of at least a portion of the 3D model vehicle operator to warn the vehicle operator that his position is indicative of inattentiveness. Alternatively, the processor 215 and/or the processor 255 may generate an advisory based on the comparison of the data representative of the 3D model of at least a portion of the vehicle driver with data representative of at least a portion of the 3D model of a vehicle operator to advise the vehicle operator how to correct her position to improve attentiveness.
The network interface 230 may be configured to facilitate communications between the vehicle in-cabin device 205 and the remote computing device 210 via any hardwired or wireless communication network 215, including for example a wireless LAN, MAN or WAN, WiFi, the Internet, or any combination thereof. Moreover, the vehicle in-cabin device 205 may be communicatively connected to the remote computing device 210 via any suitable communication system, such as via any publicly available or privately owned communication network, including those that use wireless communication structures, such as wireless communication networks, including for example, wireless LANs and WANs, satellite and cellular telephone communication systems, etc. The vehicle in-cabin device 205 may cause insurance risk related data to be stored in a remote computing device 210 memory 260 and/or a remote insurance related database 270.
The remote computing device 210 may include a memory 260 and a processor 255 for storing and executing, respectively, a module 261. The module 261, stored in the memory 260 as a set of computer-readable instructions, facilitates applications related to determining a vehicle in-cabin device location and/or collecting insurance risk related data. The module 261 may also facilitate communications between the computing device 210 and the vehicle in-cabin device 205 via a network interface 265, a remote computing device network connection 266 and the network 215 and other functions and instructions.
The computing device 210 may be communicatively coupled to an insurance related database 270. While the insurance related database 270 is shown in
Turning to
With reference to
Apparatuses, systems and methods may integrate vehicle operator gesture detection within geographic maps are provided. For example, a vehicle device (e.g., vehicle device 305b) may detect patterns in vehicle occupant gestures in aggregate (e.g., detecting a lot of head turns to one side) based on, for example, comparing current image data with previously classified image data. Furthermore, the vehicle device 305b may determine a hazard type (e.g., an accident, a traffic jam, a road closure, road construction, etc.) based on the patterns in vehicle occupant gestures. A location of the hazard may be determined based on geographic location data (e.g., vehicle location data). The vehicle device 305b may automatically transmit data related to the hazard type and/or the hazard location to, for example, a geographic map application programming interface (API) (e.g., a Waze API, a BING API, a GOOGLE maps API, etc.). Thereby, a hazard type and/or hazard location may be incorporated within realtime geographic maps without manually entering any data.
Turning to
With reference to
Vehicle driver postures may be rotated and scaled to be standardized (or normalized) vehicle device 205, 300a, 300b locations within a vehicle and standardized (or normalized) to an average human (i.e., applicable to all drivers). Subsequent to being registered within a given vehicle, a vehicle device 205, 300a, 300b may use image sensors 265, 270 to detect driver movements and record/categorize distinct driver postures (e.g., skeletal diagrams 125, 150, 155, 160, 165, 170. The methods and systems of the present disclosure may present results in two ways: 1) via detailed report of different postures; and 2) via graphical representation of the postures detected with timeframe (e.g., as in report 100 of
With reference to
The processor 225 may execute a previously classified image data receiving module 320b to cause the processor 225 to, for example, receive previously classified image data (block 510b). The previously classified image data may be, for example, representative of images and/or extracted image features that have been previously classified as being indicative of degrees of vehicle operator risk. More particularly, the previously classified image data may include images and/or extracted image features that have previously been classified as being representative of a vehicle operator using a cellular telephone, a vehicle occupant looking out a vehicle side window, a vehicle occupant adjusting a vehicle radio, a vehicle occupant adjusting a vehicle heating, ventilation and air conditioning system, two vehicle occupants talking with one-another, a vehicle occupant reading a book or magazine, a vehicle occupant putting on makeup, a vehicle occupant looking at themselves in a mirror, etc. Alternatively, or additionally, the previously classified image data may, for example, be representative of known vehicle occupant locations/orientations, known cellular telephone locations/orientations, known vehicle occupant eye locations/orientations, known vehicle occupant head location/orientation, known vehicle occupant hand location/orientation, a known vehicle occupant torso location/orientation, a known seat belt location, a known vehicle seat location/orientation, etc.
The processor 225 may execute a current image data receiving module 325b to cause the processor 225 to, for example, receive current image data (block 515b). For example, the processor 225 may receive current image data from at least one vehicle sensor (e.g., at least one of a compass sensor 327, a GPS sensor 329, an image sensor 336, 337, an infrared sensor 341, 342, an ultrasonic sensor 346, 347, and/or a microphone 351, 352). The current image data may include images and/or extracted image features that are representative of a vehicle occupant using a cellular telephone, a vehicle occupant looking out a vehicle side window, a vehicle occupant adjusting a vehicle radio, a vehicle occupant adjusting a vehicle heating, ventilation and air conditioning system, two vehicle occupants talking with one-another, a vehicle occupant reading a book or magazine, a vehicle occupant putting on makeup, a vehicle occupant looking at themselves in a mirror, etc. Alternatively, or additionally, the current image data may, for example, be representative of vehicle occupant locations/orientations, cellular telephone locations/orientations, vehicle occupant eye locations/orientations, vehicle occupant head location/orientation, vehicle occupant hand location/orientation, a vehicle occupant torso location/orientation, a seat belt location, a vehicle seat location/orientation, etc.
The processor 225 may execute a current image data classification module 330b to, for example, cause the processor 225 to classify the current image data (block 520b). For example, the processor 225 may classify the current image data by comparing the current image data to previously classified image data. Alternatively, the processor 225 may extract features from the current image data, and may compare the features that are extracted from the current image data to features extracted from previously classified image data. The classified image data may be representative of, for example, distractions associated with a driving route, a vehicle driver rating, a vehicle occupant posture, transitioning between autonomous and manual modes of vehicle operation, whether a vehicle is being operated in an autonomous mode or a manual mode, vehicle operator distractions, whether a vehicle system is distracting to a vehicle operator, a degree of risk associated with a vehicle operator, various actions of a vehicle operator, a vehicle operation log, whether a vehicle is being driven into a rising or setting sun, vehicle operator near collision misses, whether a vehicle operator is texting while driving, whether a vehicle operator is using a mobile telephone while driving, a driving environment of a vehicle operator based on vehicle occupant actions, vehicle operator distractions at particular geographic locations, instances in which a vehicle operator are unfit to operate a vehicle, vehicle occupant actions, a notification that a vehicle operator is unfit to operate an associated vehicle, a vehicle operator's emotions, etc., as described in detail in, for example, the above U.S. patent applications that have been incorporated by reference.
The processor 225 may execute a vehicle location data, current image data and/or classified image data transmission module 335b to, for example, cause the processor 225 to transmit vehicle location data, current image data and/or classified image data to at least one individual (e.g., a vehicle operator, a pedestrian, bicyclists, a law enforcement, etc.) (block 525). The processor 225 may execute a vehicle location data, current image data and/or classified image data transmission module 340b to, for example, cause the processor 225 to transmit vehicle location data, current image data and/or classified image data to at least one other vehicle (block 530). The processor 225 may execute a vehicle location data, current image data and/or classified image data receiving module 345b to, for example, cause the processor 225 to transmit vehicle location data, current image data and/or classified image data to common infrastructure (e.g., roadside equipment (RSE), a remote server, a law enforcement server, an insurance company server, etc.) (block 535).
The processor 225 may execute a vehicle location data, current image data and/or classified image data receiving module 350b to, for example, cause the processor 225 to receive vehicle location data, current image data and/or classified image data from at least one individual (block 540). The processor 225 may execute a vehicle location data, current image data and/or classified image data receiving module 340b to, for example, cause the processor 225 to receive vehicle location data, current image data and/or classified image data from at least one other vehicle (block 545). The processor 225 may execute a vehicle location data, current image data and/or classified image data module 345b to, for example, cause the processor 225 to receive vehicle location data, current image data and/or classified image data from common infrastructure (e.g., roadside equipment (RSE), a remote server, a law enforcement server, etc.) (block 550).
Apparatuses, systems and methods may integrate vehicle operator gesture detection within geographic maps are provided. For example, a vehicle device (e.g., vehicle device 305b) may detect patterns in vehicle occupant gestures in aggregate (e.g., detecting a lot of head turns to one side) based on, for example, comparing current image data with previously classified image data. Furthermore, the vehicle device 305b may determine a hazard type (e.g., an accident, a traffic jam, a road closure, road construction, etc.) based on the patterns in vehicle occupant gestures. A location of the hazard may be determined based on geographic location data (e.g., vehicle location data). The vehicle device 305b may automatically transmit data related to the hazard type and/or the hazard location to, for example, a geographic map application programming interface (API) (e.g., a Waze API, a BING API, a GOOGLE maps API, etc.). Thereby, a hazard type and/or hazard location may be incorporated within realtime geographic maps without manually entering any data.
Collective processors may determine that a risk of a crash has exceeded an acceptable threshold (or that a crash is imminent). A probability of a crash occurring may be determined by analyzing a current state of one or more drivers, dynamics and/or trajectories of one or more vehicles, locations of one or more RSE's, locations of one or more infrastructure fixtures, and/or locations and/or trajectories of one or more road users (e.g., pedestrians). A magnitude of a probable crash may be determined by analyzing a current state of one or more drivers, dynamics and/or trajectories of one or more vehicles, speeds of one or more vehicles, acceleration (or deceleration) of one or more vehicles, locations of one or more RSE's, locations of one or more infrastructure fixtures, and/or locations and trajectories of one or more road users (e.g., pedestrians). A warning may be transmitted to one or more drivers or one or more road users in order to prevent the crash.
Collective processors may determine that there are multiple vehicles with risk scores that exceed a given threshold and/or that there are multiple vehicles with driver states that are classified as potentially hazardous. The collective processors may determine that the vehicles with risk scores that exceed the threshold are within a given proximity of each other (and at least one RSE). A group of distracted vehicles may be determined to be a cluster. The cluster may consist of a changing number of vehicles, depending on the risk score and proximities in real-time. A vehicle in a distracted cluster may be Vehicle A, the network may analyze locations and headings of other vehicles, nearby the cluster of distracted vehicles. If the nearby vehicles are on a trajectory, or a route, a network onboard an approaching vehicle may reroute or redirect the driver of vehicle B (e.g., finding alternative routes, pathways, turns, etc.—away from the distracted cluster). Alternatively, or additionally, the driver of Vehicle B may elect to set this re-routing to automatic, selection-only, or deactivate completely.
Warnings may be transmitted to vulnerable (non-vehicular) road users to avoid distracted clusters, such as bicyclists, pedestrians, cars broken down on the side of the road. A system may detect driver movements within a vehicle (e.g., driver head pose, hand motions, body posture, etc.).
A degree of driver risk may be determined using, for example, a probability function where each term may be a weighted factor derived from image data, and may include images and/or extracted image features that are representative of a vehicle operator using a cellular telephone, a vehicle occupant looking out a vehicle side window, a vehicle occupant adjusting a vehicle radio, a vehicle occupant adjusting a vehicle heating, ventilation and air conditioning system, two vehicle occupants talking with one-another, a vehicle occupant reading a book or magazine, a vehicle occupant putting on makeup, a vehicle occupant looking at themselves in a mirror, vehicle occupant locations/orientations, cellular telephone locations/orientations, vehicle occupant eye locations/orientations, vehicle occupant head location/orientation, vehicle occupant hand location/orientation, a vehicle occupant torso location/orientation, a seat belt location, a vehicle seat location/orientation, etc. As a specific example, if the current image data is representative of a vehicle operator using a cellular phone and looking out a side window, the resulting risk will be higher than when the current image data is representative of the vehicle operator only looking out the side window, and not using a cellular phone. Any given vehicle operator activity may be weighted individually based upon, for example, a likelihood that the particular vehicle operator activity would cause property and/or personal damage.
Systems and methods of the present disclosure may include detecting, transmitting, and categorizing in aggregate. While previously classified image data, current image data and/or currently classified image data may be transmitted for a particular individual, the data may be aggregated anonymously for a group of individuals. A geographic application programming interface (API) may perform data aggregation. Previously classified image data, current image data and/or currently classified image data may be stored in a central data repository individually for particular individuals and/or in aggregate based on certain characteristics (e.g., geographic location, time of day, day of year, etc.).
There may be times when the system encounters previously-unclassified behaviors. For example, a device may detect driver movements from the current image data. The device may attempt to classify the current image data to previously-classified image data. Based on the uniqueness of the current image data, the device may determine that the probability of a match to a known behavior is below an acceptable threshold. The system onboard the individual device may create a sample of the 3D or 2D image data and stores on the device storage medium. When the behavior logs are uploaded to an external server, the sample image data of the unique behavior may be uploaded. Thereby, at a central data repository, a sample of a unique behavior may be collected along with other samples of unique behaviors (sent from other individual systems). From the collection of samples, pattern recognition algorithms may be applied in order to categorize the previously-uncategorized behaviors. As new categories are developed, these new classifications may be sent to update other devices in the field so that their classification systems may be even more robust for all the possible behaviors that may occur.
Turning to
A determination of a degree of risk may include a comparison of data associated with a current image with previously-classified image data. Additional factors may be taken into account in determining a degree of risk. For example, a risk of a specific behavior (e.g., texting) may evolve from identifying that the driver is currently texting and correlating that to known images of texting. Once it's known that the driver is texting, there may be different risks associated with different manifestations of that broad category of behavior.
A degree of risk for a category of behavior may depend on context. For example, other sensor inputs, aside from image data, and processing the information from at least one of those inputs (if available) may be included. However, additional inputs are not required in order to make a degree of risk determination.
Contextual factors that may form part of a degree of risk determination may include: 1) a current behavior in context of current and previous vehicle dynamics from a particular trip (e.g., variables such as lateral/longitudinal acceleration (acceleration, braking, cornering), speed, steering inputs, smooth or erratic trajectory, different risks associated with texting while in a middle of a left turn, etc.); 2) current behavior in context of current and previous locations from the particular trip; 3) current behavior in context of presence, relative distances and bearings of pedestrians near the vehicle as noted by vehicle sensors, or communications with an external database; 4) current behavior in context of previous behaviors from the particular trip (e.g., consider a risk of typing on your phone at 12:05 given information that you typed on your phone at 12:01 and 12:03, compare the degree of risk here to only typing at 12:01, one may indicate a conversation, the other may indicate a one-time message); 5) current behavior in context of known weather, road surface, or traffic conditions—this information may be provided from sensors on an associated vehicle device, vehicle sensors and/or communications from external databases; and 6) current behavior in context of roadway infrastructure type. A roadway infrastructure type may be, for example, measured directly or deduced from location and vehicle dynamics. For example, a risk of texting while on a straight part of highway may be different than a risk of texting while merging on a highway on-ramp. Consideration of current behaviors in context of factors noted above may be how the system reaches a determination of a degree of risk for the current behavior. This may start with a broad category of behavior (e.g., texting) and make a more precise risk determination based on contextual factors.
With reference to
Turning to
With reference to
Turning to
BR1 and TR1.1, 1.2 and 1.3 may be used to identify a new driver (e.g., an algorithm for recognizing the driver being a new driver). The system may use the detailed algorithm mentioned as described in
With reference to
Turning to
With reference to
Turning to
With reference to
An AppComponents::iDataManipulation 1525 may include input related to business objects acquired from or required by various business methods in other components. Output/Service may be provided for business objects extracted from a database via data access objects and methods. Depending on which component is calling, this component may have generic and client specific APIs for serving various business objects. Component/Entity process: Data connection; Connection pool; DAOs for below entities; Driver; Snapshot Object; RideDetails; and PosturesDetails. Constraints may include initial connection pool size of ten and max size may be thirty.
An AppComponents::iReadDataStream component 1535 may include input for an event to start and stop reading a video and sensor data stream from hardware. A SDK APIs may be used for reading skeleton, face and hand tracking data. Output/Service may be provided via snapshot objects and relevant joints coordinates may be output and stored in the database using Data manipulation component 1525. Live data may be transported to ReportGenerator component 1520. Component/Entity process may work as a batch process to start and stop logging the read data in the database when triggered. The component also needs to be able to transmit live data to iReportGenerator component 1520 to show it on screen. Constraints may include appropriate buffering and error handling which may be done, to make sure appropriate error messages are displayed/captured for downstream components.
An AppComponents::iClusterData component 1530 may input snapshot data read from iReadDataStream and a database. Output/Service may be provided and assign a postureID to a snapshot and update the posture-database. Component/Entity process may include: Retrieving snapshot and posture information from database; Matching snapshots with postures; Inserting new snapshot/posture information to database; Implementations of unsupervised clustering algorithms. Constraints may include a number of clusters generated has a limit.
An AppComponents::iPredictionModule component 1540 may serve to take in data from a database, and turn the data into information to leverage. The AppComponents::iPredictionModule component 1540 may identify risky drivers, review their in-cabin driving habits, and eventually act to curb these risky habits. This section explains how the data may be modeled to better understand which factors correlate to a defined risk metric and how certain behavior patterns contribute to a higher insurance risk rating.
An AppComponents::iReportGenerator 1520 may include input information taken from a database, the ten coordinates taken from the data stream during a demo, a start time, an elapsed time and some dummy information. Output/Service may be provided including a video of skeleton frames with start time and elapsed time and a report that displays charts that may illustrate what happened during the demo. The report may include a picture of the driver, the driver's name, and the range of movement of most distinct postures. The report may also have a line graph and a bar graph that show how much time the driver spent in each posture. The report may display the skeleton coordinates of the five postures the driver was in the most along with the time and number of occurrences of each. Component/Entity process may include: a Generator; a Report; a Video; a DAOs for below entities; a Ride; a Posture and a Joint. Constraints may include a demo that may have at least five different postures. Number of postures and number of occurrences should not exceed max array length.
Turning to
With reference to
Turning to
With reference to
Turning to
With reference to
Turning to
With reference to
A car-sharing insurance product could more specifically insure the driver, regardless of the car. Traditional underwriting looks at the driver-vehicle combination. What car-sharing would allow you to do is to more heavily weight the risk of the driver alone. The methods and systems of the present disclosure may allow car-sharing to get that risk information on the driver and carry it forward to whatever car they use. This would be tailored for that particular driver's behavior, rather than demographic and vehicle-use factors. This would allow certain car-sharing entities to have a cost advantage. If they are paying less insurance—or more specific insurance—they could pass those savings to their customers and have a retention strategy.
The methods and systems of the present disclosure may allow for emergency responders by, for example, using gesture recognition systems from an aftermarket/insurance device in order to provide an estimate to first responders about the severity of the crash and what kinds of resources/equipment/expertise is required in order to extricate. Using the gesture recognition systems from an aftermarket/insurance device in order to provide an estimate to first responders about the severity of the crash and what kinds of resources/equipment/expertise is required in order to triage—have some idea of what emergency medical needs could be upon arrival. Since the “golden hour” is so critical, and it's not always known how much of that hour has already expired, even a preliminary or broad clue could be helpful in the triage process. The aftermarket gesture recognition device is already operating at the time of the crash. It is collecting data about the driver's position/posture and the location of the arms relative to the body and structures in the vehicle (i.e. the steering wheel). Accelerometers in the device are able to recognize that a crash has occurred (if a pre-determined acceleration threshold has been reached). Upon crash detection the device could transmit via the driver's phone (which is already connected via Bluetooth) or perhaps transmit using an onboard transmitter that uses emergency frequencies (and therefore does not require consumer to pay for data fees). Using gesture recognition from any original equipment or aftermarket gesture tracking device, whether or not for insurance purposes.
The methods and systems of the present disclosure may allow for Transition from Automated to Manual Driving Mode in the case of vehicle automation systems operating the piloting functions with the human in a supervisory role. The vehicle encounters a situation where it needs to transfer control to the driver, but the driver may or may not be ready to resume control. The methods and systems of the present disclosure may allow gesture recognition systems, or any gesture recognition system, to be used to determine if the driver is ready to resume control. If he/she is not ready, then get his/her attention quickly. The gesture recognition would be used to ascertain whether the driver is ready to resume control by evaluating the driver's posture, the location of hands, the orientation of head, body language. Use machine learning to evaluate driver engagement/attention/readiness-to-engage based on those variables. The gesture recognition could be any original in-vehicle equipment or aftermarket device.
The methods and systems of the present disclosure may distinguish between Automated and Manual driving modalities for variable insurance rating for a scenario where there are many vehicles that are capable of automatically operating the piloting functions, and are capable of the driver manually operating the piloting functions. The driver can elect to switch between automated and manual driving modes at any point during a drive. Gesture recognition would be utilized to distinguish whether a driver is operating the vehicle manually, or whether the vehicle is operating automatically. This could be determined through either OEM or aftermarket hardware. The sensors and software algorithms are able to differentiate between automatic and manual driving based on hand movements, head movements, body posture, eye movements. It can distinguish between the driver making hand contact with the steering wheel (to show that he/she is supervising) while acting as a supervisor, versus the driver providing steering input for piloting purposes. Depending on who/what is operating the vehicle would determine what real-time insurance rates the customer is charged.
The methods and systems of the present disclosure may provide a tool for measuring driver distraction where gesture recognition may be used to identify, distinguish and quantify driver distracted for safety evaluation of vehicle automation systems. This would be used to define metrics and evaluate safety risk for the vehicle human-machine interface as a whole, or individual systems in the case where vehicles have automation and vehicle-to-vehicle/vehicle-to-infrastructure communication capabilities. Where Vehicle automation: the vehicle is capable of performing piloting functions without driver input. Where Vehicle-to-vehicle/vehicle-to-infrastructure communication: the vehicle is capable of communicating data about the first vehicle dynamics or environmental traffic/weather conditions around the first vehicle. For any entity looking to evaluate the safety or risk presented by a vehicle with automated driving capabilities, DRIVES gesture recognition could be useful to quantify risk presented by driver distraction resulting from any vehicle system in the cabin (i.e. an entertainment system, a feature that automates one or more functions of piloting, a convenience system). With the rise of vehicle automation systems and capabilities, tools will be needed to evaluate the safety of individual systems in the car, or the car as a whole. Much uncertainty remains about how these systems will be used by drivers (especially those who are not from the community of automotive engineering or automotive safety). Determining whether they create a net benefit to drivers is a big question. The methods and systems of the present disclosure may allow gesture recognition could be used to identify the presence of distracted driving behaviors that are correlated with the presence of vehicle automation capabilities. The distracted could be quantified by duration that the driver engages in certain behaviors. Risk quantification may also be measured by weighting certain behaviors with higher severity than other behaviors, so the duration times are weighted. Risk quantification may also differentiate subcategories of behaviors based on degree of motion of hands, head, eyes, body. For example, The methods and systems of the present disclosure may distinguish texting with the phone on the steering wheel from texting with the phone in the driver's lap requiring frequent glances up and down. The latter would be quantified with greater risk in terms of severity of distraction. The purpose of this risk evaluation could be for reasons including but not limited to adhere to vehicle regulations, providing information to the general public, vehicle design testing or insurance purposes.
This detailed description is to be construed as exemplary only and does not describe every possible embodiment, as describing every possible embodiment would be impractical, if not impossible. One may be implement numerous alternate embodiments, using either current technology or technology developed after the filing date of this application.
Claims
1. A device, comprising:
- one or more processors and one or more memories storing instructions, that, when executed by the one or more processors, cause the device to:
- receive current image data captured by one or more vehicle interior sensors, wherein the current image data is representative of a pattern of current vehicle occupant head gestures;
- classify the current image data as being representative of a road hazard, based on the pattern in vehicle occupant head gestures, by comparing the current image data to previously classified image data representative of a pattern of previously classified vehicle occupant head gestures that are correlated with road hazards; and
- generate a real-time geographic map incorporating an indication of the road hazard within the geographic map.
2. The device as in claim 1, wherein the one or more vehicle interior sensors include one or more of: a digital image sensor, an one ultra-sonic sensor, a radar-sensor, an infrared light sensor, or a laser light sensor.
3. The device as in claim 1, wherein the instructions, when executed by the one or more processors, further cause the device to:
- categorize previously-uncategorized behaviors based on comparing the current image data to the previously classified image data, wherein the currently classified image data is representative of the categorized previously-uncategorized behaviors.
4. The device as in claim 1, wherein the current image data is representative of a three-dimensional representation of at least one occupant within the vehicle interior.
5. The device as in claim 1, wherein the previously classified image data is representative of a three-dimensional representation of at least one occupant within the vehicle interior.
6. The device as in claim 1, wherein the current image data includes images and/or extracted image features that are representative of a vehicle occupant using a cellular telephone, a vehicle occupant looking out a vehicle side window, a vehicle occupant adjusting a vehicle radio, a vehicle occupant adjusting a vehicle heating, ventilation and air conditioning system, two vehicle occupants talking with one-another, a vehicle occupant reading a book or magazine, a vehicle occupant putting on makeup, a vehicle occupant looking at themselves in a mirror, a vehicle occupant eating, or a vehicle occupant drinking.
7. The device as in claim 1, wherein the previously classified image data includes images and/or extracted image features that have previously been classified as being representative of a vehicle occupant using a mobile device, a vehicle occupant looking out a vehicle side window, a vehicle occupant adjusting a vehicle radio, a vehicle occupant adjusting a vehicle heating, ventilation and air conditioning system, two vehicle occupants talking with one-another, a vehicle occupant reading a book or magazine, a vehicle occupant putting on makeup, a vehicle occupant looking at themselves in a mirror, a vehicle occupant eating, or a vehicle occupant drinking.
8. A computer-implemented method, comprising:
- receiving, by the one or more processors, current image data captured by one or more vehicle interior sensors, wherein the current image data is representative of at least one pattern of current vehicle occupant head gestures;
- classifying, by the one or more processors, at least one pattern of head gestures associated with a vehicle occupant as being representative of a road hazard, based on a comparison of the current image data with previously classified image data representative of a pattern of previously classified vehicle occupant head gestures that are correlated with road hazards; and
- generating, by the one or more processors, a real-time geographic map incorporating an indication of the road hazard within the geographic map.
9. The method as in claim 8, wherein the one or more vehicle interior sensors include one or more of: a digital image sensor, an ultra-sonic sensor, a radar-sensor, an infrared light sensor, or a laser light sensor.
10. The method as in claim 8, wherein the current image data is representative of a three-dimensional representation of at least one occupant within the vehicle interior.
11. The method as in claim 8, wherein at least one vehicle operator gesture is determined using a probability function.
12. The method as in claim 8, wherein the previously classified image data is representative of a three-dimensional representation of at least one occupant within the vehicle interior.
13. The method as in claim 8, wherein the current image data includes images and/or extracted image features that are representative of vehicle occupant locations/orientations, cellular telephone locations/orientations, vehicle occupant eye locations/orientations, vehicle occupant head location/orientation, vehicle occupant hand location/orientation, a vehicle occupant torso location/orientation, a seat belt location, or a vehicle seat location/orientation.
14. The method as in claim 8, wherein the previously classified image data includes images and/or extracted image features that have previously been classified as being representative of known vehicle occupant locations/orientations, known cellular telephone locations/orientations, known vehicle occupant eye locations/orientations, known vehicle occupant head location/orientation, known vehicle occupant hand location/orientation, a known vehicle occupant torso location/orientation, a known seat belt location, or a known vehicle seat location/orientation.
15. A non-transitory computer-readable medium storing computer-readable instructions that, when executed by a processor, cause the processor to:
- receive current image data captured by one or more vehicle interior sensors, wherein the current image data is representative of patterns of current vehicle occupant head gestures;
- classify the current image data as being representative of a road hazard by comparing the current image data to previously classified image data representative of a pattern of previously classified vehicle occupant head gestures that are correlated with road hazards; and
- generate a real-time geographic map incorporating an indication of the road hazard within the geographic map.
16. The non-transitory computer-readable medium as in claim 15, wherein a vehicle operator degree of risk is determined using a probability function.
17. The non-transitory computer-readable medium as in claim 15, wherein the current image data is representative of a three-dimensional representation of at least one occupant within the vehicle interior.
18. The non-transitory computer-readable medium as in claim 15, wherein the current image data includes images and/or extracted image features that are representative of a vehicle occupant using a cellular telephone, a vehicle occupant looking out a vehicle side window, a vehicle occupant adjusting a vehicle radio, a vehicle occupant adjusting a vehicle heating, ventilation and air conditioning system, two vehicle occupants talking with one-another, a vehicle occupant reading a book or magazine, a vehicle occupant putting on makeup, or a vehicle occupant looking at themselves in a mirror, vehicle occupant locations/orientations, cellular telephone locations/orientations, vehicle occupant eye locations/orientations, vehicle occupant head location/orientation, vehicle occupant hand location/orientation, a vehicle occupant torso location/orientation, a seat belt location, or a vehicle seat location/orientation.
19. The non-transitory computer-readable medium as in claim 15, wherein the previously classified image data includes images and/or extracted image features that have previously been classified as being representative of a vehicle occupant using a cellular telephone, a vehicle occupant looking out a vehicle side window, a vehicle occupant adjusting a vehicle radio, a vehicle occupant adjusting a vehicle heating, ventilation and air conditioning system, two vehicle occupants talking with one-another, a vehicle occupant reading a book or magazine, a vehicle occupant putting on makeup, a vehicle occupant looking at themselves in a mirror, known vehicle occupant locations/orientations, known cellular telephone locations/orientations, known vehicle occupant eye locations/orientations, known vehicle occupant head location/orientation, known vehicle occupant hand location/orientation, a known vehicle occupant torso location/orientation, a known seat belt location, or a known vehicle seat location/orientation.
20. The non-transitory computer-readable medium as in claim 15, wherein the previously classified image data is representative of a three-dimensional representation of at least one occupant within the vehicle interior.
Type: Application
Filed: Jan 19, 2022
Publication Date: May 12, 2022
Inventors: Aaron Scott Chan (San Jose, CA), Kenneth J. Sanchez (San Francisco, CA)
Application Number: 17/579,371