The repository presents the results of a research project implemented in the framework of scientific collaboration with Arrival. The global objective of the research is to investigate the ability of using Reinforcement Learning (RL) for solving various problems concerned with Self-Driving Car. The car model, provided by Arrival, describes the dynamics of a real car in detail and, in particular, takes into account the tire models described the forces in the road-tire interactions according to the dynamic condition of the vehicle. The repository describes only the results of experiments, but does not include the model itself.
Implementation: Vitaly Kalev, Aleksandr Goranov, Anton Plaksin
Advisors: Vladimir Bulaev
In general, the car dynamics can be divided into two motion types: longitudinal motion, where the main task is to maintain the required speed; lateral motion, where the main task is to maintain the desired direction.
The external longitudinal forces acting on the vehicle include aerodynamic drag forces, gravitational forces, longitudinal tire forces and rolling resistance forces.
A force balance along the vehicle longitudinal axis yields
where
Experimental results have established that the longitudinal tire force generated by each tire depends on the normal (vertical) force on the tire, the slip ratio
The vertical forces comes from a portion of the weight of the vehicle which is influenced by fore-aft location of the c.g., vehicle longitudinal acceleration, aerodynamic drag forces and grade of the road.
Let the longitudinal velocity at the tire under consideration be
Coefficient of friction
Such a model provides a mathematical description of the vehicle motion without considering the forces that affect the motion. The equations of motion are based purely on geometric relationships governing the system.
The forces and moments from the road act on each tire of the vehicle and highly influence the dynamics of the vehicle. Thus the lower curve in Figure below shows the trajectory that the vehicle would follow in response to a steering input from the driver if the road were dry and had a high tire-road friction coefficient. In this case the high friction coefficient is able to provide the lateral force required by the vehicle to negotiate the curved road. If the coefficient of friction were small or if the vehicle speed were too high, then the vehicle would be unable to follow the nominal motion required by the driver – it would instead travel on a trajectory of larger radius (smaller curvature), as shown in the upper curve of Figure. If the friction coefficient is very small, it might not be possible to entirely achieve the nominal yaw rate motion that would be achieved by the driver on a high friction coefficient road surface.
In the case when the longitudinal slip ratio is not small or if the road is slippery, then a nonlinear tire model needs to be used to calculate the longitudinal tire force named The Pacejka “Magic Formula” model. Therefore most algorithms of car control systems use information about the tire-road friction coefficient named and tuned to work with dry, wet or icy coating and having information about this coefficient value allows to subsequently cost control in a more effectively.
The problem can be formalized as one-step Markov Decision Process (MDP)
is the reward function, where
To solve the problem, we use cross-entropy method (CEM).
The obtained results show that for the true value of
Since a real car and its model can be different, it is important to study the stability issue of the algorithm performance with respect to the car parameters (such as length, mass, tire parameters, etc.). Within our experements, we fix
Blue line shows true
The results show the algorithm gets values quite close to the true
Let us fix a car model,
i.e. we stop the motion when the car has passed the segment.
We also use CEM algorithm. Its results are presented below.
The figure shows that the algorithms steadily converges to 8.6sec. In order to check the optimality of this value, look at the following figure:
This figure shows the dependence of force on
This problem is similar to previous one, but, in this case, the car must go through the turn as fast as possible. Moreover, the car must not drive inside a circle of a certain radius. To ensure this, we put the reward
The fastest time for a half circle is 4.3s, for 3/4 segment is 6.2s, and for full circle is 7.1s.
- Rajesh Rajamani, "Vehicle Dynamics and Control", Mechanical Engineering Series, 2012.