Training & EvalMachine Learning & Research AIK-Dense-AI/claude-scientific-skillsModel Training & Evaluation
ST

Stable Baselines3

Maintainer K-Dense Inc. · Last updated April 1, 2026

Stable Baselines3 (SB3) is a PyTorch-based library providing reliable implementations of reinforcement learning algorithms. This skill provides comprehensive guidance for training RL agents, creating custom environments, implementing callbacks, and optimi.

Claude CodeTrainingEvaluationstable-baselines3machine-learningpackagemachine learning & deep learning

Original source

K-Dense-AI/claude-scientific-skills

https://github.com/K-Dense-AI/claude-scientific-skills/tree/main/scientific-skills/stable-baselines3

Maintainer
K-Dense Inc.
License
MIT license
Last updated
April 1, 2026

Skill Snapshot

Key Details From SKILL.md

2 min

Key Notes

  • Stable Baselines3 (SB3) is a PyTorch-based library providing reliable implementations of reinforcement learning algorithms. This skill provides comprehensive guidance for training RL agents, creating custom environments, implementing callbacks, and optimizing training workflows using SB3's unified API.
  • env = gym.make("CartPole-v1").

Source Doc

Excerpt From SKILL.md

1. Training RL Agents

Basic Training Pattern:

import gymnasium as gym
from stable_baselines3 import PPO

## Initialize agent

model = PPO("MlpPolicy", env, verbose=1)

## Train the agent

model.learn(total_timesteps=10000)

Use cases

  • Use for standard RL experiments, quick prototyping, and well-documented algorithm implementations.

Not for

  • Do not rely on this catalog entry alone for installation or maintenance details.

Related skills

Related skills

Back to directory
BI
Training & EvalMachine Learning & Research AI

bio-epitranscriptomics-m6anet-analysis

Nanopore direct RNA m6A detection with m6Anet deep learning.

OpenClawNanoClawTraining
FreedomIntelligence/OpenClaw-Medical-SkillsView
BI
Training & EvalMachine Learning & Research AI

bio-imaging-mass-cytometry-interactive-annotation

Interactive cell type annotation for IMC data. Covers napari-based annotation, marker-guided labeling, training data generation, and annotat…

OpenClawNanoClawTraining
FreedomIntelligence/OpenClaw-Medical-SkillsView
BI
Training & EvalMachine Learning & Research AI

bio-immunoinformatics-tcr-epitope-binding

Predict TCR-epitope specificity using ERGO-II and deep learning models for T-cell receptor antigen recognition. Match TCRs to their cognate…

OpenClawNanoClawTraining
FreedomIntelligence/OpenClaw-Medical-SkillsView
CI
Training & EvalMachine Learning & Research AI

cirq

Cirq is Google Quantum AI's open-source framework for designing, simulating, and running quantum circuits on quantum computers and simulator…

Claude CodeTraining
K-Dense-AI/claude-scientific-skillsView