Car Racing¶

This environment is part of the Box2D environments. Please read that page first for general information.
Action Space |
Box([-1. 0. 0.], 1.0, (3,), float32) |
Observation Shape |
(96, 96, 3) |
Observation High |
255 |
Observation Low |
0 |
Import |
|
Description¶
The easiest control task to learn from pixels - a top-down racing environment. The generated track is random every episode.
Some indicators are shown at the bottom of the window along with the state RGB buffer. From left to right: true speed, four ABS sensors, steering wheel position, and gyroscope. To play yourself (it’s rather fast for humans), type:
python gym/envs/box2d/car_racing.py
Remember: it’s a powerful rear-wheel drive car - don’t press the accelerator and turn at the same time.
Action Space¶
If continuous: There are 3 actions: steering (-1 is full left, +1 is full right), gas, and breaking. If discrete: There are 5 actions: do nothing, steer left, steer right, gas, brake.
Observation Space¶
State consists of 96x96 pixels.
Rewards¶
The reward is -0.1 every frame and +1000/N for every track tile visited, where N is the total number of tiles visited in the track. For example, if you have finished in 732 frames, your reward is 1000 - 0.1*732 = 926.8 points.
Starting State¶
The car starts at rest in the center of the road.
Episode Termination¶
The episode finishes when all of the tiles are visited. The car can also go outside of the playfield - that is, far off the track, in which case it will receive -100 reward and die.
Arguments¶
lap_complete_percent
dictates the percentage of tiles that must be visited by
the agent before a lap is considered complete.
Passing domain_randomize=True
enables the domain randomized variant of the environment.
In this scenario, the background and track colours are different on every reset.
Passing continuous=False
converts the environment to use discrete action space.
The discrete action space has 5 actions: [do nothing, left, right, gas, brake].
Reset Arguments¶
Passing the option options["randomize"] = True
will change the current colour of the environment on demand.
Correspondingly, passing the option options["randomize"] = False
will not change the current colour of the environment.
domain_randomize
must be True
on init for this argument to work.
Example usage:
env = gym.make("CarRacing-v1", domain_randomize=True)
# normal reset, this changes the colour scheme by default
env.reset()
# reset with colour scheme change
env.reset(options={"randomize": True})
# reset with no colour scheme change
env.reset(options={"randomize": False})
Version History¶
v1: Change track completion logic and add domain randomization (0.24.0)
v0: Original version
References¶
Chris Campbell (2014), http://www.iforce2d.net/b2dtut/top-down-car.
Credits¶
Created by Oleg Klimov