🗺️
map
  • 算法地图
  • math
  • CS
    • Python
    • Linux
    • R
    • C/C++
    • 大数据 SQL
    • API
    • Java
    • Swift
    • 数据结构&算法
    • 云计算
    • 量子计算
    • 前端
    • 小程序
    • 音视频
    • 安全
    • Hack
    • 面试
    • cs工具
    • 其他
  • Algorithm
    • ML
    • NLP
    • CV
    • Audio
    • GNN
    • KG
    • GANs
    • RL
    • 自动驾驶
    • 推荐
    • 搜索
    • 量化
    • 区块链
    • 其他
  • Device
    • GPU
    • TPU
    • Android/IOS
    • 鸿蒙
    • 物联网
  • 科学Sci
    • 物理
    • 化学
    • 生物
    • 医学
    • 工科
      • 通信
        • 科学上网
      • 机械工程
      • 电气工程
        • 电子工程
      • 电力工程
      • 土木工程
      • 环境工程
    • 人文社科
      • 文学
      • 历史
      • 哲学
      • 法律
      • 经济
      • 管理学
      • 社会学
      • 心理学
      • 教育学
      • 会计
      • 职场
      • 传统
    • 地理
    • 艺术
      • 美术
      • 摄影
      • 音乐
      • 舞蹈
      • 体育
    • 工具
    • Ai工具
    • 其他
  • other
  • 赞助本站
  • View on Github
  • Star History
由 GitBook 提供支持
在本页
  • 平台
  • OpenAI
  • UP
  • 课程
  • 算法
  • 具身智能 Embodied AI
  • 游戏AI
  • 其他

这有帮助吗?

  1. Algorithm

RL

强化学习 让我以另一种方式玩游戏吧

上一页GANs下一页自动驾驶

最后更新于5个月前

这有帮助吗?

DeepMind Made A Superhuman AI For 57 Atari Games! 🕹

A.I. Flappy Bird without Libraries from SCRATCH (Python/PyCharm)

平台

OpenAI

UP

课程

Using AI to accelerate scientific discovery - Demis Hassabis (Crick Insight Lecture Series)

CS234: Reinforcement Learning

Reinforcement Learning - David Silver

What is the Statistical Complexity of Reinforcement Learning?

强化学习的统计复杂性

Reinforcement Learning in 3 Hours

[Tutorialsplanet.NET] Udemy - Advanced AI Deep Reinforcement Learning in Python

[Tutorialsplanet.NET] Udemy - Artificial Intelligence Reinforcement Learning in Python

[Tutorialsplanet.NET] Udemy - Artificial Intelligence Reinforcement Learning in Python

强化学习是一种机器学习的类型,涉及代理通过反复试验来学习如何在环境中做出决策。代理的目标是最大化由环境给出的奖励信号。代理学习采取导致最大可能奖励的行动,同时避免导致负面结果的行动。

Richard S. Sutton和Andrew G. Barto的《强化学习导论》一书全面介绍了强化学习领域。该书涵盖价值函数、蒙特卡罗方法、时序差分学习和策略梯度等主题。

该书的第一版于1998年出版,第二版目前正在编写中。第二版根据领域内最新进展更新了材料,并增加了有关深度强化学习和多智能体强化学习的新章节。

该书被广泛认为是关于强化学习的最权威的文本之一,并被该领域的研究人员和实践者用作参考。它适合本科和研究生学生,并为任何对学习或从事强化学习感兴趣的人提供了坚实的基础。

算法

DQN
RLHF

PPO

Evolution

具身智能 Embodied AI

游戏AI

Voyager

其他

俄罗斯方块Tetris AI Learns to Play Tetris [Cocos Creator/TypeScript] Archi Tsai

AirSim

AirSim是由微软开发的一个开源的模拟器,用于模拟无人机、汽车和机器人等各种类型的机器人的行为和环境。它提供了高度可定制的环境,允许用户在虚拟场景中测试各种机器人算法,包括视觉SLAM、路径规划、控制等等。

AirSim的最大特点是其高度逼真的图形渲染引擎和物理模拟引擎。它使用了虚幻引擎作为渲染引擎,并使用了现代计算机图形学技术来模拟各种物理现象,例如惯性、空气阻力、摩擦力等等,以使得机器人在仿真环境中的行为和现实世界中的行为尽量相似。

AirSim还提供了一套API,使得用户可以轻松地控制和监测机器人的状态。这些API可以用C++、Python和ROS等语言和框架进行访问。

总之,AirSim为机器人研究和开发人员提供了一个快速、高效、低成本的测试平台,可以加速机器人技术的发展。

rl + 无人机

腾讯开悟(, )

Reinforcement Learning(, )

OpenAI(, , , ) openai Research index

OpenAI CEO, CTO on risks and how AI will reshape society

Sam Altman: OpenAI CEO on GPT-4, ChatGPT, and the Future of AI | Podcast #367

Breakthrough potential of AI | Sam Altman | MIT 2023

OpenAI CEO Sam Altman testifies at Senate artificial intelligence hearing | full video

LIVE: OpenAI CEO Sam Altman testifies during Senate hearing on AI oversight — 05/16/23

Open AI CEO第一次国会听证会内容介绍

【OpenAI】萨姆奥特曼 Sam Altman出席国会听证会 | 积极拥抱政府监管 | AI企业要上牌照 | 建议成立国际组织 | AI将创造更多就业 | 不为赚钱只因热爱

Yuxi Li(, )

Code Bullet(, )

ClarityCoders

Greer Viau(, )

DeepMind

Machine Learning with Phil

Lex Fridman

蓝仔的十八般武艺 抖音号:

Shusen Wang

AI探长 抖音号:

Yifei Hu

Emergent Garden

学渣程序员 抖音号:

Edan Meyer

AI Prism

Saasha Nair

Tien-Lung Sun

Pourquoi (布瓜的世界)

强化学习基础(本科课程)-北京邮电大学

秒懂强化学习 Reinforcement Learning

强化学习 Reinforcement Learning Python 教学 教程

什么是深度强化学习(DRL)?【知多少】

什么是强化学习(Reinforcement Learning)?【知多少】

在Unity環境中訓練強化學習AI!

Tim & Heinrich — Democraticizing Reinforcement Learning Research

Train AI to Play Snake – Reinforcement Learning Course (Python, PyTorch, Pygame)

Reinforcement learning with Snake-RL - Made with TensorFlow.js

Algorithmic SNAKES! (AI compilation)

How does electricity find the "Path of Least Resistance"?

贪吃蛇游戏数学算法人工智能AI创造世界纪录 哈密尔顿回路

代码编程 华容道

分步详解C语言贪吃蛇游戏

【Python】60行搞定贪吃蛇小游戏

【python游戏编程教程】【小白友好版】贪吃蛇 五子棋 三子棋 联机

我用30天写了一个完美的贪吃蛇AI

MIT 6.S191: Reinforcement Learning

AI Learns to Play

Deep Maths - machine learning and mathematics

DeepMind x UCL | Reinforcement Learning Course 2018

Alpha Go

CS885 Reinforcement Learning - Spring 2020

CS885 Reinforcement Learning - Spring 2018 - University of Waterloo

深度强化学习完整版-2020秋-UC Berkeley CS285 by Sergey Levine

Reinforcement Learning with Python()

A.I. Learns to play Flappy Bird()

AI Learns to play...

AI is programmed to play...

AI Plays Flappy Bird - NEAT Python

Python Pong AI Tutorial - Using NEAT

Greer Viau

Reinforcement Learning

Reinforcement Learning - Goal Oriented Intelligence

Reinforcement Learning - Developing Intelligent Agents

MarI/O - Machine Learning for Video Games

MarIQ -- Q-Learning Neural Network for Mario Kart -- 2M Sub Special

Reinforcement Learning by David Silver

Reinforcement Learning - Emma Brunskill | Stanford - OnlineHub

reinforcement learning Matlab

秒懂强化学习 Reinforcement Learning

强化学习基础(张志华)-北京大学

深度强化学习基础

决胜AI-强化学习实战系列视频课程 唐宇迪

讓人工智慧玩捉迷藏,最後居然發展出連人類都想不到的策略!? | 一探啾竟 第80集 |

OpenAI Plays Hide and Seek…and Breaks The Game! 🤖

這是我看過最廢的人工智慧了...

CS 294-112 Deep Reinforcement Learning UC Berkeley

Python Bots Playing Games and More!!

A.I. Battles

Python Reinforcement Learning using Gymnasium – Full Course

Reinforcement Learning Course: Intro to Advanced Actor Critic Methods

Reinforcement Learning Course - Full Machine Learning Tutorial

Python AI Learns to Play the Chrome Dinosaur Game | Made with Pygame and NEAT

Build a Chrome Dino Game AI Model with Python | AI Learns to Play Dino Game

Python A.I. (N.E.A.T.)

Chrome Dinosaur in Pygame

Pygame Tutorials

Python AI Learns To Play Flappy Bird! | Python NEAT and Pygame

Flappy Bird Tutorial

Intro to Reinforcement Learning 强化学习纲要

Reinforcement Learning

Reinforcement Learning with Stable Baselines 3

Physics Simulator w/ Robot Dog

Starcraft 2 AI

永不坠落的小鸟—游戏中的人工智能

An introduction to Reinforcement Learning

Reinforcement Learning with sparse rewards

An introduction to Policy Gradient methods - Deep Reinforcement Learning

Learning to Walk via Deep Reinforcement Learning

Equivariant RL

AI Learns To Draw New Pokemon

Making My First Machine Learning Game

Advanced Topics in Reinforcement Learning

This AI Learned Boxing…With Serious Knockout Power! 🥊

Control Strategies for Physically Simulated Characters Performing Two-player Competitive Sports

Deep Reinforcement Learning in Python Tutorial - A Course on How to Implement Deep Learning Papers

Q Learning In Reinforcement Learning | Q Learning Example | Machine Learning Tutorial |

Artificial Intelligence Lessons

Reinforcement Learning

Reinforcement learning with TensorFlow Agents

TensorFlow and deep reinforcement learning, without a PhD (Google I/O '18)

The fastest matrix multiplication algorithm

Deep Reinforcement Learning: CS 285 Fall 2021 (UC Berkeley)

Deep Reinforcement Learning: CS 285 Fall 2020

David Silver: AlphaGo, AlphaZero, and Deep Reinforcement Learning | Lex Fridman Podcast #86

AI's Game Playing Challenge -

Google's Deep Mind Explained! - Self Learning A.I.

Teach AI To Play Snake! Reinforcement Learning With PyTorch and Pygame

Download Practical AI with Python and Reinforcement Learning

NVIDIA’s New AI Trained For 10 Years! But How? 🤺

NVIDIA’s AI Plays Minecraft After 33 Years of Training! 🤖

DeepMind Makes Prototyping Papers Easy with ACME

Deep Reinforcement Learning Tutorials - All Videos

Advanced Actor Critic and Policy Gradient Methods

Learning RL Algorithms via ML

Research Talk: Dueling network architectures for deep reinforcement learning

Tutorial - Search Solutions 2020 - IRSG

혁펜하임의 “트이는” 강화 학습 (Reinforcement learning)

Code Frozen Game Using Reinforcement Learning | OpenAI Gym | Python Project

Creating binance trading bot GUI | Python | Live trading

Fundamentals of Reinforcement Learning

深度強化學習簡介 (Deep Reinforcement Learning)

Taipei Tech Deep Reinforcement Learning

Ubisoft’s New AI: Breathing Life Into Games!

Superintelligence: Science or Fiction? | Elon Musk & Other Great Minds

Reinforcement Learning Fundamentals

Reinforcement Learning

Reinforcement Learning

Data-driven Optimization Workshop: Deep Reinforcement Learning in Supply Chain Optimizations

【强化学习的数学原理】课程视频合集(从零开始透彻理解强化学习)

Talk | 悉尼科技大学在读博士生胡思逸:MARLlib,全新的多智能体强化学习框架

Reinforcement Learning for Simple UAV Navigation

Reinforcement Learning: An Introduction stanford Second edition, in progress

Learning From Passive Data Explained

Playing Atari with Deep Reinforcement Learning 2013.12

Reinforcement Learning - Ep. 30 (Deep Learning SIMPLIFIED)

CURL: Contrastive Unsupervised Representations for Reinforcement Learning

Reinforcement Learning from Human Feedback: From Zero to chatGPT

John Schulman - Reinforcement Learning from Human Feedback: Progress and Challenges

How ChatGPT works - From Transformers to Reinforcement Learning with Human Feedback (RLHF)

RLHF+CHATGPT: What you must know

AI Safety, RLHF, and Self-Supervision - Jared Kaplan | Stanford MLSys #79

【分享】State of GPT(GPT的现状)中文字幕精校版 | Andrej Karpathy 微软Build大会精彩演讲 | GPT状态和原理 | 解密OpenAI模型训练

【機器學習 2023】(生成式 AI)

RL — Proximal Policy Optimization (PPO) Explained

强化学习与ChatGPT:PPO 算法介绍和实际应用(中文介绍)

Python Reinforcement Learning using Stable baselines. Mario PPO

Google AI Simulates Evolution On A Computer! 🦖

【人工智能】具身智能:下一个AI浪潮 | 稚晖君 | Embodied AI | 什么是具身智能 | 目前发展阶段 | 挑战与困难 | 智元远征A1机器人

【人工智能】全新AI智能体Voyager | 自己学会玩minecraft | 全场景终身学习 | 性能完胜AutoGPT | 英伟达Nvidia最新发布 | NPC取代人类玩家 无梯度架构 终身学习

MineDojo/

NVIDIA’s New AI Mastered Minecraft 15X Faster!

DQN_HollowKnight(, , )

快手斗地主 DouZero(, , , , , , )

俄羅斯方塊已死...? 2022世界大賽到底發生了什麼事?

Coding Adventure: Chess AI

How To Hack The Google Chrome Dinosaur Game [PYTHON] | Only 10 Lines Of Coding | Pyautogui | Numpy

Deep Reinforcement Learning in Python Tutorial 

AI's Game Playing Challenge -

AlphaStar: The inside story

David Silver: AlphaGo, AlphaZero, and Deep Reinforcement Learning | Lex Fridman Podcast #86

AlphaZero from Scratch – Machine Learning Tutorial

The story of AlphaGo

AlphaGo full movie HD

阿尔法狗用什么算法击败李世石?《阿尔法围棋》 |

DeepMind AlphaStar Analysis and Impressions (StarCraft II)

StarCraft 2: Google DeepMind AlphaStar (A.I.) vs Pro Gamer!

Reinforcement Learning for Stock Prediction

DeepMind's New AI: As Smart As An Engineer... Kind Of! 🤯

Artificial Intelligence & Machine Learning

Coding Challenge #71: Minesweeper

How to play Minesweeper

Python Game Development Project Using OOP – Minesweeper Tutorial (w/ Tkinter)

蘑菇书 Easy RL 强化学习教程 datawhalechina/

alibaba/

Reinforcement Learning in 3 Hours | Full Course using Python

Discovering novel algorithms with AlphaTensor

Deepmind AlphaTensor Algorithmic Discovery with AI | Paper + Code

【線性代數 2022 (課程補充)】AlphaTensor: 用增強式學習 (Reinforcement Learning) 找出更有效率的矩陣相乘演算法

DRL, Deep Reinforcement Learning, 2018

ML Lecture 23-1: Deep Reinforcement Learning

Machine Learning (Hung-yi Lee, NTU)

This is a game changer! (AlphaTensor by DeepMind explained)

AlphaFold 2 论文精读【论文精读】

Deep Reinforcement Learning with OpenAI Gym in Python

格斗之王!AI写出来的AI竟然这么强!

DeepMind’s AI Athletes Play In The Real World! colab

深度强化学习训练智能体:超级玛丽
DQN in Pytorch Stream 3 of N | Atari Breakout + Logging and Monitoring
Two Minute Papers
Python Flappy Bird AI Tutorial (with NEAT) - Creating the Bird
Max Teaches Tech
How to Solve a Basic Reinforcement Learning Example | RL Hello World
An introduction to Reinforcement Learning
s
quora
TutsNode.com
site
git
Baselines
GPT-3
s
ABC News
Lex Fridman
Imagination in Action
CBS News
CNBC Television
Jeff科技视角
最佳拍档
u
u
u
u
u
u
u
lanzai8888
u
en
AITanzhang
u
u
67129424878
u
u
u
u
u
刘先生
莫烦Python
莫烦Python
KnowingAI知智
KnowingAI知智
AI葵
Weights & Biases
freeCodeCamp
TensorFlow
AlphaPhoenix
AlphaPhoenix
Oziter茅
Oziter茅
大雄的公开课
Bennett Poitier
Stephanie_程序媛
林亦LYi
Alexander Amini
list
Normalized Nerd
Oxford Mathematics
DeepMind
DeepMind
weibin zhuang
Pascal Poupart
Pascal Poupart
Math4AI
Nicholas Renotte
Code Bullet
Code Bullet
Code Bullet
Tech With Tim
Tech With Tim
Neural Network Learns to Play Snake
Yannic Kilcher
RE•WORK
NPTEL-NOC IITM
deeplizard
deeplizard
SethBling
SethBling
道法自然
Rahul Madhavan
Raony Maia Fontes
莫烦Python
刘先生
Shusen Wang
网易云课堂
啾啾鞋
Two Minute Papers
啾啾鞋
coursehero
eecs
CAL ESG - EECS
reddit
ClarityCoders
ClarityCoders
freeCodeCamp
freeCodeCamp
freeCodeCamp
enigma
git
Nicholas Renotte
Max teaches Tech
Max teaches Tech
Max teaches Tech
enigma
git
Max teaches Tech
Bolei Zhou
git
sentdex
sentdex
sentdex
sentdex
开发者学堂
Arxiv Insights
Arxiv Insights
Arxiv Insights
Jie Tan
Simons Institute
Simons Institute
Jabrils
Jabrils
DeepPavlov
Two Minute Papers
Meta Research
freeCodeCamp
Simplilearn
Dr. Daniel Soper
Steve Brunton
TensorFlow
TensorFlow
Dr. Trefor Bazett
RAIL
RAIL
Lex Fridman
Computerphile
ColdFusion
Python Engineer
tut4dev
Two Minute Papers
Two Minute Papers
Machine Learning with Phil
Machine Learning with Phil
Machine Learning with Phil
Edan Meyer
Stanford Scholar
BCS Member Groups
혁펜하임
AI Sciences
AI Sciences
AI Sciences
Kuan-Ting Lai
Kuan-Ting Lai
Two Minute Papers
Future of Life Institute
Mutual Information
AI Insights - Rituraj Kaushik
Krish Naik
Microsoft Research
Aerial robotics @ Westlake University
将门-TechBeat技术社区
Huy Pham
pdf
Edan Meyer
arxiv
pdf
pdf
DeepLearning.TV
Machine Learning Street Talk
HuggingFace
Berkeley EECS
John Tan Chong Min
Machine Learning Street Talk
Stanford MLSys Seminars
最佳拍档
Hung-yi Lee
medium
Pourquoi (布瓜的世界)
ClarityCoders
Two Minute Papers
最佳拍档
最佳拍档
arxiv
Voyager
Two Minute Papers
git
v
arxiv
git
s
reddit
paperswithcode
dczha
v
啾啾鞋
Sebastian Lague
Know-How
freeCodeCamp
Computerphile
DeepMind
Lex Fridman
freeCodeCamp
DeepMind
Zucci
看电影了没
brownbear
LowkoTV
Siraj Raval
Two Minute Papers
ForrestKnight
The Coding Train
Eric Buffington
freeCodeCamp
s
epubit
easy-rl
errata
db
EasyReinforcementLearning
Nicholas Renotte
deepmind
v
Simon Lermen AI
Hung-yi Lee
Hung-yi Lee
Hung-yi Lee
Hung-yi Lee
Yannic Kilcher
Mu Li
NeuralNine
林亦LYi
Two Minute Papers
16MB
Bilgin E. Mastering Reinforcement Learning with Python 2021.pdf
pdf