RL in Super Mario Bros

 Status : Completed

Tags: Python OpenAI Gym Nes-py Stable-baselines3 Gym-super-mario-bros Proximal Policy Optimization(PPO)



AIM

Develop an autonomous AI model using machine learning to play "Super Mario Bros.," showcasing its ability to understand and navigate dynamic gaming environments without human intervention.


COMPONENTS AND TECHNOLOGIES USED

  • Python

  • OpenAI Gym

  • Proximal Policy Optimization(PPO)

  • Gym-super-mario-bros

  • Nes-py

  • Stable-baselines3


OVERVIEW

Introduction:

The project titled "Reinforcement Learning in Super Mario Bros" is dedicated to harnessing advanced artificial intelligence methodologies to create a self-sufficient agent proficient in playing the iconic video game, Super Mario Bros. Through the application of Reinforcement Learning (RL) techniques, the project endeavors to empower an AI agent to adeptly maneuver the game's intricate landscapes, surmount obstacles, and successfully navigate levels with growing competence. Employing RL algorithms, the agent will assimilate knowledge from its interactions, enabling it to make decisions geared towards maximizing rewards and enhancing its gameplay skills progressively.


Methodology - 

1. Initially, got familiarized with Python language and concepts of OOPS in Python.

2. Introduction to Machine learning(ML) and different Deep Learning Framework.

3. Familiarize with OpenAIGym and Machine learning models like Proximal Policy Optimization (PPO)

4. To apply the above concepts, make a model For Cartpole Game. 

5. After having sufficient knowledge build a mode for Super Mario Bros.

 

Real-life applications:

1. Autonomous Systems: RL is used in autonomous vehicles (self-driving cars, drones) and robots to learn how to navigate complex environments, make decisions, and avoid obstacles.

2. Game Playing: RL algorithms have been used to create game-playing agents that can excel in games like chess, Go, and video games. Notable examples include AlphaGo, which defeated human Go champions.

3. Finance and Trading: RL is used in algorithmic trading to optimize trading strategies, manage portfolios, and make investment decisions based on market conditions and historical data.

 

Problems faced:

To address unexpected errors in the code caused by incompatible library versions.

 

Mentors:

S.N

Name

Branch

Reg. No

1

Rishi Mishra

ECE

20215096

2

Aayush Verma

ECE

20215056

 

Contributors:   

TEAM 1

S.N

Name

Branch

Reg. No.

1

Aryan Kesharwani

ECE

20224035

2

Krishna Gupta

EE

20225045

3

Sahaj Srivastava

CSE

20223222

4

Devansh Sharma

PIE

20227019

5

Aritra Mahara

ME

20226030

6

Chirag

ECE

20224052

7

Harshit Shukla

CSE

20220025

 

TEAM 2

Name

Branch

Registration No.

Ankit Upadhyay

ME

20226023

Garvit Jain

ECE

20224066

Aryan Srivastav

ECE

20224036

Meemansha Singh

ECE

20224097

Shrishti Tomar

ME

20226140

Ayush Mishra

PIE

20227015

 

TEAM 3

 

Ayush Srivastava

ME

20213084

 

 

RL in Super Mario Bros Project Team