AI Learns to steal
278,391
Published 2023-12-19
Key Hyperparameters for those interested:
type: PPO
Batch Size: 512
Buffer Size: 4096
Hidden Units: 512
Hidden Layers: 4 (yep, 4 actually got me the best learning at a decent rate, typically I'd try with 2 or three layers first)
total training time:
approx 8.4 hours
20,800,000 steps
music:
1 - 368 - Dyalla
2 - All I Am - Dyalla
3 - Beyond - Patrick Patrikios
4 - TORSION - Density & Time
5 - AETHER - Density & Time
6 - Organic Guitar House - Dyalla
textures from:
PBR Material Bundle Vol1 - Cole
100+ PBR Materials Pack - Integrity Software & Games
AllSky - 220+ Sky / Skybox Set - RPGWHITELOCK
and some bad ones from me!
models from:
Low Poly Ultimate Pack - POLYPERFECT
Low Poly Disco Bar Pack - GOLS Consulting
All Comments (21)
-
Imagine if a dude broke into your house, did sick flips for 3 minutes then left without saying a word.
-
Sad to see Dingus fall on hard times.
-
Dingus is not a thief, he is a GREAT ASSET to the company
-
I imagine Dingus was given no memory which explains why Dingus immediately moves on when objects fall out of view. With that in mind, going up the staircase and jumping off is actually a brilliant strategy because that path has more visual coverage than anywhere else in the room.
-
His addiction to boulder rolling has led Dingus to a life of crime.
-
I'd love to see him try and steal in even more environments, with uneven or unnatural terrain, or more hazards.
-
if i ever see 3 stairs i will now purposefully avoid the middle one at all times even if it's an easier path
-
hes struggling to resist the voices’ demands to climb up the stairs and flip off the balcony
-
I wonder if Dingus would be able to easily generalize to new maps, or if he's just learned how to beat these particular maps. Future video idea?
-
If he struggles with this imagine this with guards
-
I have a feeling the stairway loop has something to do with the reward system in some way. Maybe pushing down a valuable generates a high reward value as opposed to just nudging it like in the beginning, since it travels much closer to the car, thus leading Dingus to associate the stairway with that rewards spike.
-
9:13 Reminds me of video game speedrun brute forcers that find crazy tricks like this by testing every possible permutation of inputs. Very cool discovery.
-
I love the part where Dingus starts chugging wine and shoves the speaker as far as he can under the table. I can picture an actual incompetent thief getting wine drunk and violently proceed to force the electronics under furniture.
-
I want to see one with two competing AIs training concurrently, like a cop AI vs a robber AI
-
all future dinguses should be granted small rewards for doing sick flips from staircases because those really were some sick flips
-
I love how dingus was literally just any player of a game goofy around at the start
-
So sad seeing dingus have to resort to this 😔
-
A good representation of how I learn how to play a new game. Go dingus...
-
Dingus may be a idiot but he's our idiot. Never change dingus. Never change.
-
Honestly, I’m surprised it went so smoothly. My guess is that you left out a lot of the trial and error that I know goes into this-I’ve heard of people running simulations for literal days and coming out with an ai that was only slightly better than before.