Garth Wales
  • Introduction
  • META
  • To be added elsewhere
  • University
    • 400 Project Research
    • Aims
    • Presentation
    • Report
    • Work Diary
  • Deep Declarative Networks
    • Potential avenues
    • Work Diary
    • Deep Declarative Networks
    • Image Segmentation
  • shortcuts / syntax / setup
    • Jupyter
    • SSH
    • Python
    • CUDA
    • Screen
    • Markdown Syntax
    • Git
    • Dendron
    • Visual Studio Code
    • LaTeX
    • Windows
    • Bash
    • Vim
  • Machine Learning
    • Markov decision process
    • Reinforcement learning
    • Deep Q Network
    • Modern hopfield network
    • Object Detection
    • Face detection
    • Activation Functions
    • Gym
    • RL frameworks
    • Hyperparameters
    • Interpretability
  • Reinforcement learning
    • Memory
    • Exploration techniques
    • Agents
  • garth.wales
    • Website redesign
  • Computer Graphics and Vision
    • Image Mosaicing
    • Stereo Vision
    • Rendering
    • Exam papers
  • Neural Networks
    • Introduction
    • Learning
    • Learning theory
    • Stochastic Neural Networks
    • Interpretability
    • Word Respresentation
    • Language modelling with recurrent networks
    • Sequence-to-sequence networks
    • Exam
Powered by GitBook
On this page

Was this helpful?

  1. Machine Learning

Gym

OpenAI's way to create a reinforcement learning environment, used by most RL implementations.

PreviousActivation FunctionsNextRL frameworks

Last updated 4 years ago

Was this helpful?

Gym seeks to create standard environments, enabling better benchmarks to drive research in reinfrocement learning.

An environment contains:

  • action_space: space object corresponding to valid actions

  • observation_space: space object corresponding to valid observations

  • reward_range: corresponding min and max possible rewards

A step in an environment contains:

  • observation (object): pixel data, angles, velocities or board states

  • reward (float): amount of reward achieved by previous action

  • done (boolean): whether to reset the environment, termination action.

  • info (dict): diagnositic information (i.e. raw probabilities for last state)

The framework of reinforcement learning

How to create custom environment:

Keras-RL uses gym style environments (although the included ones are )

https://gym.openai.com/docs/#environments
blank abstract classes