Comparison between clustering-based bonus rewards with novelty alone (η = 1.0) and clustering-based bonus rewards (η = 0.5). Here, the collected states (blue dots) are clustered into 5 clusters and ...
What if our brains learned from rewards not just by averaging them but by considering their full range of possibilities? A ...