RL

强化学习 让我以另一种方式玩游戏吧

深度强化学习训练智能体:超级玛丽arrow-up-right

DQN in Pytorch Stream 3 of N | Atari Breakout + Logging and Monitoringarrow-up-right

DeepMind Made A Superhuman AI For 57 Atari Games! 🕹 Two Minute Papersarrow-up-right

Python Flappy Bird AI Tutorial (with NEAT) - Creating the Birdarrow-up-right

A.I. Flappy Bird without Libraries from SCRATCH (Python/PyCharm) Max Teaches Techarrow-up-right

How to Solve a Basic Reinforcement Learning Example | RL Hello Worldarrow-up-right

An introduction to Reinforcement Learningarrow-up-right

平台

腾讯开悟(sarrow-up-right, )

Reinforcement Learning(quoraarrow-up-right, )

OpenAI

OpenAI CEO, CTO on risks and how AI will reshape society ABC Newsarrow-up-right

Sam Altman: OpenAI CEO on GPT-4, ChatGPT, and the Future of AI | Lex Fridmanarrow-up-right Podcast #367

Breakthrough potential of AI | Sam Altman | MIT 2023 Imagination in Actionarrow-up-right

OpenAI CEO Sam Altman testifies at Senate artificial intelligence hearing | full video CBS Newsarrow-up-right

LIVE: OpenAI CEO Sam Altman testifies during Senate hearing on AI oversight — 05/16/23 CNBC Televisionarrow-up-right

Open AI CEO第一次国会听证会内容介绍 Jeff科技视角arrow-up-right

【OpenAI】萨姆奥特曼 Sam Altman出席国会听证会 | 积极拥抱政府监管 | AI企业要上牌照 | 建议成立国际组织 | AI将创造更多就业 | 不为赚钱只因热爱 最佳拍档arrow-up-right

UP

Yuxi Li(uarrow-up-right, )

Code Bullet(uarrow-up-right, )

ClarityCoders uarrow-up-right

Greer Viau(uarrow-up-right, )

Machine Learning with Phil uarrow-up-right

Lex Fridman uarrow-up-right

蓝仔的十八般武艺 抖音号: lanzai8888arrow-up-right

AI探长 抖音号: AITanzhangarrow-up-right

Emergent Garden uarrow-up-right

学渣程序员 抖音号:67129424878arrow-up-right

Edan Meyer uarrow-up-right

Saasha Nair uarrow-up-right

Tien-Lung Sun uarrow-up-right

Pourquoi (布瓜的世界) uarrow-up-right

课程

强化学习基础(本科课程)-北京邮电大学 刘先生arrow-up-right

秒懂强化学习 Reinforcement Learning 莫烦Pythonarrow-up-right

强化学习 Reinforcement Learning Python 教学 教程 莫烦Pythonarrow-up-right

什么是深度强化学习(DRL)?【知多少】 KnowingAI知智arrow-up-right

什么是强化学习(Reinforcement Learning)?【知多少】KnowingAI知智arrow-up-right

在Unity環境中訓練強化學習AI! AI葵arrow-up-right

Tim & Heinrich — Democraticizing Reinforcement Learning Research Weights & Biasesarrow-up-right

Train AI to Play Snake – Reinforcement Learning Course (Python, PyTorch, Pygame) freeCodeCamparrow-up-right

Reinforcement learning with Snake-RL - Made with TensorFlow.js TensorFlowarrow-up-right

Algorithmic SNAKES! (AI compilation) AlphaPhoenixarrow-up-right

How does electricity find the "Path of Least Resistance"? AlphaPhoenixarrow-up-right

贪吃蛇游戏数学算法人工智能AI创造世界纪录 Oziter茅arrow-up-right 哈密尔顿回路

代码编程 Oziter茅arrow-up-right 华容道

分步详解C语言贪吃蛇游戏 大雄的公开课arrow-up-right

【Python】60行搞定贪吃蛇小游戏 Bennett Poitierarrow-up-right

【python游戏编程教程】【小白友好版】贪吃蛇 Stephanie_程序媛arrow-up-right 五子棋 三子棋 联机

我用30天写了一个完美的贪吃蛇AI 林亦LYiarrow-up-right

MIT 6.S191: Reinforcement Learning Alexander Aminiarrow-up-right listarrow-up-right

Deep Maths - machine learning and mathematics Oxford Mathematicsarrow-up-right

Using AI to accelerate scientific discovery - Demis Hassabis (Crick Insight Lecture Series)

DeepMindarrow-up-right

DeepMind x UCL | Reinforcement Learning Course 2018 DeepMindarrow-up-right

CS885 Reinforcement Learning - Spring 2020 Pascal Poupartarrow-up-right

CS885 Reinforcement Learning - Spring 2018 - University of Waterloo Pascal Poupartarrow-up-right

CS234: Reinforcement Learning

深度强化学习完整版-2020秋-UC Berkeley CS285 by Sergey Levine Math4AIarrow-up-right

Reinforcement Learning with Python(Nicholas Renottearrow-up-right)

A.I. Learns to play Flappy Bird(Code Bulletarrow-up-right)

AI Learns to play... Code Bulletarrow-up-right

AI is programmed to play... Code Bulletarrow-up-right

AI Plays Flappy Bird - NEAT Python Tech With Timarrow-up-right

Python Pong AI Tutorial - Using NEAT Tech With Timarrow-up-right

Reinforcement Learning - Goal Oriented Intelligence deeplizardarrow-up-right

Reinforcement Learning - Developing Intelligent Agents deeplizardarrow-up-right

MarI/O - Machine Learning for Video Games SethBlingarrow-up-right

MarIQ -- Q-Learning Neural Network for Mario Kart -- 2M Sub Special SethBlingarrow-up-right

Reinforcement Learning - David Silver

Reinforcement Learning by David Silver 道法自然arrow-up-right

Reinforcement Learning - Emma Brunskill | Stanford - OnlineHub Rahul Madhavanarrow-up-right

reinforcement learning Matlab Raony Maia Fontesarrow-up-right

秒懂强化学习 Reinforcement Learning 莫烦Pythonarrow-up-right

强化学习基础(张志华)-北京大学 刘先生arrow-up-right

深度强化学习基础 Shusen Wangarrow-up-right

决胜AI-强化学习实战系列视频课程 唐宇迪 网易云课堂arrow-up-right

讓人工智慧玩捉迷藏,最後居然發展出連人類都想不到的策略!? | 一探啾竟 第80集 | 啾啾鞋arrow-up-right

OpenAI Plays Hide and Seek…and Breaks The Game! 🤖 Two Minute Papersarrow-up-right

這是我看過最廢的人工智慧了... 啾啾鞋arrow-up-right

Python Bots Playing Games and More!! ClarityCodersarrow-up-right

Python Reinforcement Learning using Gymnasium – Full Course freeCodeCamparrow-up-right

Reinforcement Learning Course: Intro to Advanced Actor Critic Methods freeCodeCamparrow-up-right

Reinforcement Learning Course - Full Machine Learning Tutorial freeCodeCamparrow-up-right

Python AI Learns to Play the Chrome Dinosaur Game | Made with Pygame and NEAT enigmaarrow-up-right gitarrow-up-right

Build a Chrome Dino Game AI Model with Python | AI Learns to Play Dino Game Nicholas Renottearrow-up-right

Python A.I. (N.E.A.T.) Max teaches Techarrow-up-right

Chrome Dinosaur in Pygame Max teaches Techarrow-up-right

Pygame Tutorials Max teaches Techarrow-up-right

Python AI Learns To Play Flappy Bird! | Python NEAT and Pygame enigmaarrow-up-right gitarrow-up-right

Flappy Bird Tutorial Max teaches Techarrow-up-right

Intro to Reinforcement Learning 强化学习纲要 Bolei Zhouarrow-up-right gitarrow-up-right

Reinforcement Learning sentdexarrow-up-right

Reinforcement Learning with Stable Baselines 3 sentdexarrow-up-right

Physics Simulator w/ Robot Dog sentdexarrow-up-right

Starcraft 2 AI sentdexarrow-up-right

永不坠落的小鸟—游戏中的人工智能 开发者学堂arrow-up-right

An introduction to Reinforcement Learning Arxiv Insightsarrow-up-right

Reinforcement Learning with sparse rewards Arxiv Insightsarrow-up-right

An introduction to Policy Gradient methods - Deep Reinforcement Learning Arxiv Insightsarrow-up-right

Learning to Walk via Deep Reinforcement Learning Jie Tanarrow-up-right

What is the Statistical Complexity of Reinforcement Learning?

强化学习的统计复杂性

Simons Institutearrow-up-right

AI Learns To Draw New Pokemon Jabrilsarrow-up-right

Making My First Machine Learning Game Jabrilsarrow-up-right

Advanced Topics in Reinforcement Learning DeepPavlovarrow-up-right

This AI Learned Boxing…With Serious Knockout Power! 🥊 Two Minute Papersarrow-up-right

Control Strategies for Physically Simulated Characters Performing Two-player Competitive Sports Meta Researcharrow-up-right

Deep Reinforcement Learning in Python Tutorial - A Course on How to Implement Deep Learning Papers freeCodeCamparrow-up-right

Q Learning In Reinforcement Learning | Q Learning Example | Machine Learning Tutorial | Simplilearnarrow-up-right

Artificial Intelligence Lessons Dr. Daniel Soperarrow-up-right

Reinforcement Learning Steve Bruntonarrow-up-right

Reinforcement learning with TensorFlow Agents TensorFlowarrow-up-right

TensorFlow and deep reinforcement learning, without a PhD (Google I/O '18) TensorFlowarrow-up-right

The fastest matrix multiplication algorithm Dr. Trefor Bazettarrow-up-right

Deep Reinforcement Learning: CS 285 Fall 2021 (UC Berkeley) RAILarrow-up-right

Deep Reinforcement Learning: CS 285 Fall 2020 RAILarrow-up-right

David Silver: AlphaGo, AlphaZero, and Deep Reinforcement Learning | Lex Fridman Podcast #86 Lex Fridmanarrow-up-right

AI's Game Playing Challenge - Computerphilearrow-up-right

Google's Deep Mind Explained! - Self Learning A.I. ColdFusionarrow-up-right

Teach AI To Play Snake! Reinforcement Learning With PyTorch and Pygame Python Engineerarrow-up-right

Download Practical AI with Python and Reinforcement Learning tut4devarrow-up-right

NVIDIA’s New AI Trained For 10 Years! But How? 🤺 Two Minute Papersarrow-up-right

NVIDIA’s AI Plays Minecraft After 33 Years of Training! 🤖 Two Minute Papersarrow-up-right

DeepMind Makes Prototyping Papers Easy with ACME Machine Learning with Philarrow-up-right

Deep Reinforcement Learning Tutorials - All Videos Machine Learning with Philarrow-up-right

Advanced Actor Critic and Policy Gradient Methods Machine Learning with Philarrow-up-right

Learning RL Algorithms via ML Edan Meyerarrow-up-right

Research Talk: Dueling network architectures for deep reinforcement learning Stanford Scholararrow-up-right

Tutorial - Search Solutions 2020 - IRSG BCS Member Groupsarrow-up-right

혁펜하임의 “트이는” 강화 학습 (Reinforcement learning) 혁펜하임arrow-up-right

Code Frozen Game Using Reinforcement Learning | OpenAI Gym | Python Project AI Sciencesarrow-up-right

Creating binance trading bot GUI | Python | Live trading AI Sciencesarrow-up-right

Fundamentals of Reinforcement Learning AI Sciencesarrow-up-right

深度強化學習簡介 (Deep Reinforcement Learning) Kuan-Ting Laiarrow-up-right

Taipei Tech Deep Reinforcement Learning Kuan-Ting Laiarrow-up-right

Ubisoft’s New AI: Breathing Life Into Games! Two Minute Papersarrow-up-right

Superintelligence: Science or Fiction? | Elon Musk & Other Great Minds Future of Life Institutearrow-up-right

Reinforcement Learning in 3 Hours

Reinforcement Learning Fundamentals Mutual Informationarrow-up-right

[Tutorialsplanet.NET] Udemy - Advanced AI Deep Reinforcement Learning in Python

[Tutorialsplanet.NET] Udemy - Artificial Intelligence Reinforcement Learning in Python

[Tutorialsplanet.NET] Udemy - Artificial Intelligence Reinforcement Learning in Python

Reinforcement Learning Krish Naikarrow-up-right

Data-driven Optimization Workshop: Deep Reinforcement Learning in Supply Chain Optimizations Microsoft Researcharrow-up-right

【强化学习的数学原理】课程视频合集(从零开始透彻理解强化学习)Aerial robotics @ Westlake Universityarrow-up-right

Talk | 悉尼科技大学在读博士生胡思逸:MARLlib,全新的多智能体强化学习框架 将门-TechBeat技术社区arrow-up-right

Reinforcement Learning for Simple UAV Navigation Huy Phamarrow-up-right

Reinforcement Learning: An Introduction pdfarrow-up-right stanford Second edition, in progress

强化学习是一种机器学习的类型,涉及代理通过反复试验来学习如何在环境中做出决策。代理的目标是最大化由环境给出的奖励信号。代理学习采取导致最大可能奖励的行动,同时避免导致负面结果的行动。

Richard S. Sutton和Andrew G. Barto的《强化学习导论》一书全面介绍了强化学习领域。该书涵盖价值函数、蒙特卡罗方法、时序差分学习和策略梯度等主题。

该书的第一版于1998年出版,第二版目前正在编写中。第二版根据领域内最新进展更新了材料,并增加了有关深度强化学习和多智能体强化学习的新章节。

该书被广泛认为是关于强化学习的最权威的文本之一,并被该领域的研究人员和实践者用作参考。它适合本科和研究生学生,并为任何对学习或从事强化学习感兴趣的人提供了坚实的基础。

Learning From Passive Data Explained Edan Meyerarrow-up-right

算法

DQN

Playing Atari with Deep Reinforcement Learning arxivarrow-up-right pdfarrow-up-right pdfarrow-up-right 2013.12

Reinforcement Learning - Ep. 30 (Deep Learning SIMPLIFIED) DeepLearning.TVarrow-up-right

CURL: Contrastive Unsupervised Representations for Reinforcement Learning Machine Learning Street Talkarrow-up-right

RLHF

Reinforcement Learning from Human Feedback: From Zero to chatGPT HuggingFacearrow-up-right

John Schulman - Reinforcement Learning from Human Feedback: Progress and Challenges Berkeley EECSarrow-up-right

How ChatGPT works - From Transformers to Reinforcement Learning with Human Feedback (RLHF) John Tan Chong Minarrow-up-right

RLHF+CHATGPT: What you must know Machine Learning Street Talkarrow-up-right

AI Safety, RLHF, and Self-Supervision - Jared Kaplan | Stanford MLSys #79 Stanford MLSys Seminarsarrow-up-right

【分享】State of GPT(GPT的现状)中文字幕精校版 | Andrej Karpathy 微软Build大会精彩演讲 | GPT状态和原理 | 解密OpenAI模型训练 最佳拍档arrow-up-right

【機器學習 2023】(生成式 AI) Hung-yi Leearrow-up-right

PPO

RL — Proximal Policy Optimization (PPO) Explained mediumarrow-up-right

强化学习与ChatGPT:PPO 算法介绍和实际应用(中文介绍) Pourquoi (布瓜的世界)arrow-up-right

Python Reinforcement Learning using Stable baselines. Mario PPO ClarityCodersarrow-up-right

Evolution

Google AI Simulates Evolution On A Computer! 🦖 Two Minute Papersarrow-up-right

具身智能 Embodied AI

【人工智能】具身智能:下一个AI浪潮 | 稚晖君 | Embodied AI | 什么是具身智能 | 目前发展阶段 | 挑战与困难 | 智元远征A1机器人 最佳拍档arrow-up-right

游戏AI

Voyager

【人工智能】全新AI智能体Voyager | 自己学会玩minecraft | 全场景终身学习 | 性能完胜AutoGPT | 英伟达Nvidia最新发布 | NPC取代人类玩家 最佳拍档arrow-up-right 无梯度架构 终身学习

NVIDIA’s New AI Mastered Minecraft 15X Faster! Two Minute Papersarrow-up-right

其他

DQN_HollowKnight(gitarrow-up-right, varrow-up-right, )

俄罗斯方块Tetris AI Learns to Play Tetris [Cocos Creator/TypeScript] Archi Tsai

俄羅斯方塊已死...? 2022世界大賽到底發生了什麼事? 啾啾鞋arrow-up-right

Coding Adventure: Chess AI Sebastian Laguearrow-up-right

How To Hack The Google Chrome Dinosaur Game [PYTHON] | Only 10 Lines Of Coding | Pyautogui | Numpy Know-Howarrow-up-right

Deep Reinforcement Learning in Python Tutorial freeCodeCamparrow-up-right

AI's Game Playing Challenge - Computerphilearrow-up-right

AlphaStar: The inside story DeepMindarrow-up-right

David Silver: AlphaGo, AlphaZero, and Deep Reinforcement Learning | Lex Fridman Podcast #86 Lex Fridmanarrow-up-right

AlphaZero from Scratch – Machine Learning Tutorial freeCodeCamparrow-up-right

The story of AlphaGo DeepMindarrow-up-right

AlphaGo full movie HD Zucciarrow-up-right

阿尔法狗用什么算法击败李世石?《阿尔法围棋》 | 看电影了没arrow-up-right

DeepMind AlphaStar Analysis and Impressions (StarCraft II) brownbeararrow-up-right

StarCraft 2: Google DeepMind AlphaStar (A.I.) vs Pro Gamer! LowkoTVarrow-up-right

Reinforcement Learning for Stock Prediction Siraj Ravalarrow-up-right

DeepMind's New AI: As Smart As An Engineer... Kind Of! 🤯 Two Minute Papersarrow-up-right

Artificial Intelligence & Machine Learning ForrestKnightarrow-up-right

Coding Challenge #71: Minesweeper The Coding Trainarrow-up-right

How to play Minesweeper Eric Buffingtonarrow-up-right

Python Game Development Project Using OOP – Minesweeper Tutorial (w/ Tkinter) freeCodeCamparrow-up-right

Reinforcement Learning in 3 Hours | Full Course using Python Nicholas Renottearrow-up-right

Discovering novel algorithms with AlphaTensor deepmindarrow-up-right varrow-up-right

Deepmind AlphaTensor Algorithmic Discovery with AI | Paper + Code Simon Lermen AIarrow-up-right

【線性代數 2022 (課程補充)】AlphaTensor: 用增強式學習 (Reinforcement Learning) 找出更有效率的矩陣相乘演算法 Hung-yi Leearrow-up-right

DRL, Deep Reinforcement Learning, 2018 Hung-yi Leearrow-up-right

ML Lecture 23-1: Deep Reinforcement Learning Hung-yi Leearrow-up-right

Machine Learning (Hung-yi Lee, NTU) Hung-yi Leearrow-up-right

This is a game changer! (AlphaTensor by DeepMind explained) Yannic Kilcherarrow-up-right

AlphaFold 2 论文精读【论文精读】 Mu Liarrow-up-right

Deep Reinforcement Learning with OpenAI Gym in Python NeuralNinearrow-up-right

格斗之王!AI写出来的AI竟然这么强! 林亦LYiarrow-up-right

DeepMind’s AI Athletes Play In The Real World! Two Minute Papersarrow-up-right colab

AirSim

AirSim是由微软开发的一个开源的模拟器,用于模拟无人机、汽车和机器人等各种类型的机器人的行为和环境。它提供了高度可定制的环境,允许用户在虚拟场景中测试各种机器人算法,包括视觉SLAM、路径规划、控制等等。

AirSim的最大特点是其高度逼真的图形渲染引擎和物理模拟引擎。它使用了虚幻引擎作为渲染引擎,并使用了现代计算机图形学技术来模拟各种物理现象,例如惯性、空气阻力、摩擦力等等,以使得机器人在仿真环境中的行为和现实世界中的行为尽量相似。

AirSim还提供了一套API,使得用户可以轻松地控制和监测机器人的状态。这些API可以用C++、Python和ROS等语言和框架进行访问。

总之,AirSim为机器人研究和开发人员提供了一个快速、高效、低成本的测试平台,可以加速机器人技术的发展。

rl + 无人机

最后更新于