Skip to content
View taodav's full-sized avatar

Block or report taodav

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. pobax pobax Public

    Partially Observable Benchmarks in JAX

    Python 21 4

  2. brownirl/lambda_discrepancy brownirl/lambda_discrepancy Public

    Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy

    Python 21

  3. microsoft/TextWorld microsoft/TextWorld Public

    ​TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.

    Jupyter Notebook 1.4k 191

  4. nsrs nsrs Public

    Code for the paper Novelty Search in Representational Space for Sample Efficient Exploration presented at NeurIPS 2020.

    Jupyter Notebook 14 3