3

I'm trying to solve Rubik's cube using deep learning and I came across with DQN, so I decided to give it a try. I developed all the code and started training but I got this results: enter image description here

Loss goes up and test never get better results. I have tried to change learning rate, epsilon greedy decay, reducing scramble moves to one but it still can't solve it with just one move. That's why I would like to know if it just impossible to do it this way or I could miscalculating some values.

Jonathan
  • 5,310
  • 1
  • 7
  • 21

1 Answers1

0

I think yes, you can, since rubric group can be represented as a grid word environment.

See this link from GitHub, they attempted solving using dqn

Stephen Rauch
  • 1,783
  • 11
  • 21
  • 34
DCnoob
  • 121
  • 6