训练与评测机器学习与科研 AIK-Dense-AI/claude-scientific-skills训练与评测
ST

Stable Baselines3

维护者 K-Dense Inc. · 最近更新 2026年4月1日

Stable Baselines3 (SB3) is PyTorch-based 库 providing reliable implementations of reinforcement learning algorithms。 This skill provides comprehensive guidance ,用于 training RL agents,creating custom environments,implementing callbacks,、 optimi。

Claude Code训练编排评测比较stable-baselines3machine-learningpackagemachine learning & deep learning

原始来源

K-Dense-AI/claude-scientific-skills

https://github.com/K-Dense-AI/claude-scientific-skills/tree/main/scientific-skills/stable-baselines3

维护者
K-Dense Inc.
许可
MIT license
最近更新
2026年4月1日

技能摘要

来自 SKILL.md 的关键信息

2 min

核心说明

  • Stable Baselines3 (SB3) is PyTorch-based 库 providing reliable implementations of reinforcement learning algorithms. This skill provides comprehensive guidance ,用于 training RL agents,creating custom environments,implementing callbacks,、 optimizing training workflows ,使用 SB3's unified API。
  • env = gym.make("CartPole-v1")。

原始文档

SKILL.md 摘录

1. Training RL Agents

Basic Training Pattern:

import gymnasium as gym
from stable_baselines3 import PPO

## Initialize agent

model = PPO("MlpPolicy", env, verbose=1)

## Train the agent

model.learn(total_timesteps=10000)

适用场景

  • 可用于standard RL experiments,quick prototyping,、 well-documented algorithm implementations。

不适用场景

  • Do not rely on this catalog entry alone ,用于 installation 或 maintenance details。

相关技能

相关技能

返回目录
BI
训练与评测机器学习与科研 AI

bio-epitranscriptomics-m6anet-analysis

bio-epitranscriptomics-m6anet-analysis:Nanopore direct RNA m6A detection ,支持 m6Anet 深度学习。

OpenClawNanoClaw训练编排
FreedomIntelligence/OpenClaw-Medical-Skills查看
BI
训练与评测机器学习与科研 AI

bio-imaging-mass-cytometry-interactive-annotation

bio-imaging-mass-cytometry-interactive-annotation:Interactive cell type annotation ,用于 IMC data。 Covers napari-based ann…

OpenClawNanoClaw训练编排
FreedomIntelligence/OpenClaw-Medical-Skills查看
BI
训练与评测机器学习与科研 AI

bio-immunoinformatics-tcr-epitope-binding

bio-immunoinformatics-tcr-epitope-binding:预测 TCR-epitope specificity ,使用 ERGO-II 、 深度学习 models ,用于 T-cell receptor antig…

OpenClawNanoClaw训练编排
FreedomIntelligence/OpenClaw-Medical-Skills查看
CI
训练与评测机器学习与科研 AI

cirq

cirq:Cirq is Google Quantum AI's open-source 框架 ,用于 designing,simulating,、 running quantum circuits on quantum computers…

Claude Code训练编排
K-Dense-AI/claude-scientific-skills查看