AI Learns to steal

278,391

10,511 0

Published 2023-12-19

Based on a true story!

Key Hyperparameters for those interested:
type: PPO
Batch Size: 512
Buffer Size: 4096
Hidden Units: 512
Hidden Layers: 4 (yep, 4 actually got me the best learning at a decent rate, typically I'd try with 2 or three layers first)

total training time:
approx 8.4 hours
20,800,000 steps

music:
1 - 368 - Dyalla
2 - All I Am - Dyalla
3 - Beyond - Patrick Patrikios
4 - TORSION - Density & Time
5 - AETHER - Density & Time
6 - Organic Guitar House - Dyalla

textures from:
PBR Material Bundle Vol1 - Cole
100+ PBR Materials Pack - Integrity Software & Games
AllSky - 220+ Sky / Skybox Set - RPGWHITELOCK
and some bad ones from me!

models from:
Low Poly Ultimate Pack - POLYPERFECT
Low Poly Disco Bar Pack - GOLS Consulting

All Comments (21)

@kalandillon8320 4 months ago

Imagine if a dude broke into your house, did sick flips for 3 minutes then left without saying a word.
@JamesJamesW 4 months ago

Sad to see Dingus fall on hard times.
@Lozoot2 4 months ago

Dingus is not a thief, he is a GREAT ASSET to the company
@bitblit 4 months ago

I imagine Dingus was given no memory which explains why Dingus immediately moves on when objects fall out of view. With that in mind, going up the staircase and jumping off is actually a brilliant strategy because that path has more visual coverage than anywhere else in the room.
@morgan3392 4 months ago

His addiction to boulder rolling has led Dingus to a life of crime.
@Nevernamed 4 months ago

I'd love to see him try and steal in even more environments, with uneven or unnatural terrain, or more hazards.
@SkillfulLupus 4 months ago

if i ever see 3 stairs i will now purposefully avoid the middle one at all times even if it's an easier path
@Memer9456 1 month ago

hes struggling to resist the voices’ demands to climb up the stairs and flip off the balcony
@McTuber42 4 months ago

I wonder if Dingus would be able to easily generalize to new maps, or if he's just learned how to beat these particular maps. Future video idea?
@icebreaker690 4 months ago

If he struggles with this imagine this with guards
@vrchhu4066 4 months ago

I have a feeling the stairway loop has something to do with the reward system in some way. Maybe pushing down a valuable generates a high reward value as opposed to just nudging it like in the beginning, since it travels much closer to the car, thus leading Dingus to associate the stairway with that rewards spike.
@punishedkid 4 months ago

9:13 Reminds me of video game speedrun brute forcers that find crazy tricks like this by testing every possible permutation of inputs. Very cool discovery.
@Fighting.Flower 1 month ago

I love the part where Dingus starts chugging wine and shoves the speaker as far as he can under the table. I can picture an actual incompetent thief getting wine drunk and violently proceed to force the electronics under furniture.
@zacharymacleod1146 4 months ago

I want to see one with two competing AIs training concurrently, like a cop AI vs a robber AI
@neoncreeper1287 4 months ago

all future dinguses should be granted small rewards for doing sick flips from staircases because those really were some sick flips
@SusDoctor 4 months ago

I love how dingus was literally just any player of a game goofy around at the start
@SunshineBurrito 4 months ago

So sad seeing dingus have to resort to this 😔
@LemonPlayz101 4 months ago

A good representation of how I learn how to play a new game. Go dingus...
@ethanwilde4716 4 months ago

Dingus may be a idiot but he's our idiot. Never change dingus. Never change.
@nathanreeder9716 4 months ago

Honestly, I’m surprised it went so smoothly. My guess is that you left out a lot of the trial and error that I know goes into this-I’ve heard of people running simulations for literal days and coming out with an ai that was only slightly better than before.