Status : Completed
Tags: Python OpenAI Gym Nes-py Stable-baselines3 Gym-super-mario-bros Proximal Policy Optimization(PPO)
Develop an autonomous AI model using machine learning to play "Super Mario Bros.," showcasing its ability to understand and navigate dynamic gaming environments without human intervention.
Python
OpenAI Gym
Proximal Policy Optimization(PPO)
Gym-super-mario-bros
Nes-py
Stable-baselines3
Introduction:
The project titled "Reinforcement Learning in Super Mario Bros" is dedicated to harnessing advanced artificial intelligence methodologies to create a self-sufficient agent proficient in playing the iconic video game, Super Mario Bros. Through the application of Reinforcement Learning (RL) techniques, the project endeavors to empower an AI agent to adeptly maneuver the game's intricate landscapes, surmount obstacles, and successfully navigate levels with growing competence. Employing RL algorithms, the agent will assimilate knowledge from its interactions, enabling it to make decisions geared towards maximizing rewards and enhancing its gameplay skills progressively.
Methodology -
1. Initially, got familiarized with Python language and concepts of OOPS in Python.
2. Introduction to Machine learning(ML) and different Deep Learning Framework.
3. Familiarize with OpenAIGym and Machine learning models like Proximal Policy Optimization (PPO)
4. To apply the above concepts, make a model For Cartpole Game.
5. After having sufficient knowledge build a mode for Super Mario Bros.
Real-life applications:
1. Autonomous Systems: RL is used in autonomous vehicles (self-driving cars, drones) and robots to learn how to navigate complex environments, make decisions, and avoid obstacles.
2. Game Playing: RL algorithms have been used to create game-playing agents that can excel in games like chess, Go, and video games. Notable examples include AlphaGo, which defeated human Go champions.
3. Finance and Trading: RL is used in algorithmic trading to optimize trading strategies, manage portfolios, and make investment decisions based on market conditions and historical data.
Problems faced:
To address unexpected errors in the code caused by incompatible library versions.
Mentors:
S.N |
Name |
Branch |
Reg. No |
1 |
Rishi Mishra |
ECE |
20215096 |
2 |
Aayush Verma |
ECE |
20215056 |
Contributors:
TEAM 1
S.N |
Name |
Branch |
Reg. No. |
1 |
Aryan Kesharwani |
ECE |
20224035 |
2 |
Krishna Gupta |
EE |
20225045 |
3 |
Sahaj Srivastava |
CSE |
20223222 |
4 |
Devansh Sharma |
PIE |
20227019 |
5 |
Aritra Mahara |
ME |
20226030 |
6 |
Chirag |
ECE |
20224052 |
7 |
Harshit Shukla |
CSE |
20220025 |
TEAM 2
Name |
Branch |
Registration No. |
Ankit Upadhyay |
ME |
20226023 |
Garvit Jain |
ECE |
20224066 |
Aryan Srivastav |
ECE |
20224036 |
Meemansha Singh |
ECE |
20224097 |
Shrishti Tomar |
ME |
20226140 |
Ayush Mishra |
PIE |
20227015 |
TEAM 3
Ayush Srivastava |
ME |
20213084 |
RL in Super Mario Bros Project Team