Questions tagged [keras-rl]

A framework for Reinforcement Learning with Keras

Code

https://github.com/matthiasplappert/keras-rl

11 questions

votes

1 answer

What are the effects of clipping the reward in stability?

I am looking for stabilizing my results of DQN, I found clipping is one technique to do it but I did not understand it completely! 1- what are the effects of clipping the reward, clipping the gradient, clipping the error in stability and how makes…

asked Sep 30 '18 at 03:04

user10296606

1,784
5
17
31

votes

1 answer

How to implement clipping the reward in DQN in keras

How to implement clipping the reward in DQN in keras? especially how to implement clipping the reward? Is this pseudo code correct: if reward<-threshold reward=-1 elseif reward>threshold reward=1 elseif -threshold

deep-learning tensorflow training dqn keras-rl

asked Oct 02 '18 at 09:06

user10296606

1,784
5
17
31

votes

1 answer

What is a minimal setup to solve the CartPole-v0 with DQN?

I solved the CartPole-v0 with a CEM agent pretty easily (experiments and code), but I struggle to find a setup which works with DQN. Do you know which parameters should be adjusted so that the mean reward is about 200 for this problem? What I…

reinforcement-learning keras-rl openai-gym dqn

asked Nov 09 '17 at 08:14

Martin Thoma

18,630
31
92
167

votes

1 answer

Evaluating a trained Reinforcement Learning Agent?

I am new to reinforcement learning agent training. I have read about PPO algorithm and used stable baselines library to train an agent using PPO. So my question here is how do I evaluate a trained RL agent. Consider for a regression or…

reinforcement-learning dqn actor-critic monte-carlo keras-rl

asked Oct 30 '19 at 11:41

chink

votes

0 answers

Actions taken by agentn/ agent performance not improving

Hi I am trying to develop an rl agent using PPO algorithm. My agent takes an action(CFM) to maintain a state variable called RAT in between 24 to 24.5. I am using PPO algorithm of stable-baselines library to train my agent.I have trained the agent…

reinforcement-learning actor-critic keras-rl discounted-reward

asked Jan 21 '20 at 05:41

chink

votes

2 answers

Keras models break when I add batch normalization

I'm creating the model for a DDPG agent (keras-rl version) but i'm having some trouble with errors whenever I try adding in batch normalization in the first of two networks. Here is the creation function as i'd like it to be: def…

neural-network keras batch-normalization keras-rl

asked Jul 11 '19 at 20:28

axon

vote

0 answers

Is "nb_steps_warmup" set for each episode or globally?

When I configure a DQN agent, nb_steps_warmup can be set. Is this parameter set for each episode or once globally? What I am trying to ask is, imaging I have a game environment which takes about 3000 max. steps per episode. The DQN is fitted as…

python keras reinforcement-learning keras-rl

asked Jan 21 '21 at 08:14

StefanOverFlow

vote

1 answer

Formulation of a reward structure

I am new to reinforcement learning and experimenting with training of RL agents. I have a doubt about reward formulation, from a given state if a agent takes a good action i give a positive reward, and if the action is bad, i give a negative reward.…

reinforcement-learning ai actor-critic keras-rl discounted-reward

asked Nov 26 '19 at 10:26

chink

vote

0 answers

with tf.device(DEVICE): model = modellib.MaskRCNN(mode = "inference", model_dir = LOGS_DIR, config = config)

ValueError Traceback (most recent call last) /miniconda/lib/python3.6/site-packages/tensorflow/python/framework/op_def_library.py in _apply_op_helper(self, op_type_name, name, **keywords) 509 as_ref=input_arg.is_ref, --> 510…

keras tensorflow faster-rcnn keras-rl

asked Oct 26 '18 at 17:37

shiva

votes

0 answers

using Reinforcement learning for binary classification

I want to build an agent for binary classification. I have a large dataset with two label (0 and 1). I want to build an agent to predict labels. I build a deep model and now I want to build an agent. I use keras-rl2. but there is a problem: for dqn…

reinforcement-learning optimization keras-rl binary-classification

asked Jun 17 '21 at 09:39

sdbvuf sbjdsfdib

votes

1 answer

Q-Learning experience replay: how to feed the neural network?

I'm trying to replicate the DQN Atari experiment. Actually my DQN isn't performing well; checking another one's codes, I saw something about experience replay which I don't understand. First, when you define your CNN, in the first layer you have to…

python reinforcement-learning q-learning dqn keras-rl

asked Apr 11 '19 at 14:13

Joaquin